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ao. UNQ230 a)NA3443,.nT7, U.. «^ „cC ^ . ATCC d.po... no. ATCC 209399. 

Analyse of -^c anuno add «,u„cc of m.^r^ PR0263 polypcpue= s^nes. ±3, poruoc. of . 
pos.«. s,gnnca« ho:»,iog>. ,o CD« a.dg=n. .hereby pR02« may b= a „ov=, „U ,.racc adh.s:oc 

inolccdic. 

EXAMPLE ^4 : holatior nf rnMi^ ri^ .^ '^ n -O'ti'- pp rp^^ 

A cons=n^ DNA s=,a=:.. as«mbl.d reladv. ,o th. od.=r idcctifid EST .«u.nc« a. described i. 
E«n,pl= 1 above. When, the co:.o.u. seance wa. desi^aa^d bereb a. DNA35712. Based on e>e DNA357I2 
coasensus sequent, ol«onucieondes we„ synthesized: >, to identify by PCR a cDNA libraty that =ontai«d tte 
sequence of interest, and 2) for use as probes to tsoiate a clone of th. fuU-lengtb codin, sequence for PRO270 

Forward and reverse PCR primers were synthesized: 
W^PCPw -(.fl) 5 -CCITGGATATrCGCATGGGCCTAC-3' (SEQ ID NO-208) 
WdPCRnrim - r (.fl) 5 -TGGAGACAATATCCCTCAGG.3' (SEQ ID NO 209) 
itv;r5ePrRprirrrrfr11 5 -AACAGrrGCCCACAGCATGGCAGG-3- (SEQ ID NO-210) 
Additionally, a synthetic oUgonuCeotide hybridizauon probe was constructed ftom the consensus DNA35712 
15 sequence which had the following nucleotide sequence 
hvhririi>arinT^ jjf^^f 

5-CCATTGATGAGGAACTAGAACGGGACAAGAGGGTCACTrGGATrCTGCAG.3- 
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^ - b-btarics for a source of a Ml-length clone. DNA front the libraries was screened 

by PCR arnphficabon with the PCR pruner pair identified above. A posiUve library was then used to isoUte Cones 
encodutg the PRO270 gene using the probe oUgonucIeotide and one of the PCR primers 

RNA for construction of the cDNA libraries was isolated from hnn^ fetal hmg tissue. DNA«^ 

UNQ237. DNA395.C.I181, (SEQ ID NO:206) and the derived proxin sequence for PRO270 

n °'^^"'°->"'' « '"0- » F^ure 75 (SEQ ID NO:206) 

Clone UNQ237(DNA3«l.U81)contau.asin.e open reading frarnewtthan apparent 

a, nudeon. pcsttions 3-5 and e,^, a, the stpp codon a. nucleonde pcsiuons 891-893 (Hg. 75: SEQ ID NO:206) 

d!o7 d T ™ ' ™<^' (I'NA395.0.n8n ^ been 

deposited wnh ATCC and is assigned ATCC deposit no, ATCC 209392 

^7°^,^^'='^«^--f^«^->»8*PR0270suggest..hatportionsofi,p<,sses^ 

A consensus DNA set^ence was assembled relative to other EST set^nces using phrap as described in 
^1 ™.conse„sussequenceishereindcsignatedDNA35737. Based on the DKA35737 co^e^ 

«^ence. ol.gonucleondes were synthestzed: 1, to idendfy by PCR a cDNA Ubrary U.t contained the sequence of 
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miercsi. and 2) for use a. probes to isoiatt a clone of the Ml-leng- coding sequence for PR0271. 

Forward and reverse PGR pnmers were synthesized: 
fon^'^rt! PCR rrir - I 5'-TGCTrCGCrACTGCCCTC-3' (SEQ ID N0.214) 

forward pcH pnrrr 7 s -ttcccttgtgggttggag-B' (SEQ id no:2I5) 

fOP^ar;t PCR prirrr- . 5 -AGGGCTGGAAGCCAGTTC-3' (SEQIDNO:216) 

r?v;r?e PCP prim r r ] 5 -AGCCAGTGAGGAAATGCG-3' (SEQ id N0:217) 

r=v,r»PrR Pri T n? 5MGTCCAAAGTACACACACCTGAGG-3' {SEQIDN0 218) 

Addu^cnally. a synthetic oUgonucieonde h>-bndizaaon probe wa. cons^cted from the consensus DNA35737 

sciiucnce whjch had the foUowing imdeoiide sequence 

hvbridiTarinn pmtv 

5 -GATGCCACGATCGCCAAGGTGGGACAGCTCTrTGCCGCCTGG.^G-3' (SEQ ID NO-2I9) 

In crocr ,o screen several libraries for a .ouree of a full-length clone, DNA from *e libraries wa. .creene.. 
by PCR a^^pltf^on ^ .he PO. p™.r pair idc..fi=d above. A pc.hlve library wa. ,hcn .ed «, «c.a,e clone, 
enccdmg the PR0271 gene ming .he probe oUgonndeonde and one of ttie PCR primers. 

RKA for consxmcdon of fte cDNA Ubnxies was isolated from hunian feal brain DNA««x™ 

UNQ238 (DNA39.23-,182), (SEQ m N-0:212, and ^ derived protein for PR0271 ' ■ 

ne enttre mt^leonde «.p^e of UNQ238 (DNA39423.n82) is shown in Figure 77 (SEQ ID NO-2I2) 
Ci™ UNQ238 (D.NA3,.23-, ,82) cor^ , ^„ „ ^^^^ .^^^ 

a, oucleo^de posinons 10M03 and ending a. .he s.op e«lon a. nnc.eo.ide posiuons ,181-1153 (Pignre 77) Tie 
pred-c^ polvpeptide precursor is 360 amino acids long (Figure 78). Clone UNQ238 (DNA39423-1182) has been 
deposited with ATCC and is assigned ATCC deposit no. ATCC 209387. 

Analysis of the amino acid settee of ^ M-length PR027. polypeptide suggests tha, i, possess 
signtfican. hom^togy .o the pro.oglycan hn. protein ^ ^, p^^,, 1 may be a Bn. pro.ein homolog. 

A consensus DNA sequence was assembled relative u> EST sequetxe. using phrap as described in 
E^le. above. consensus sequence is. herein designa.ed DNA36460, Based on the DNA36460 c^sensus 

mterest. and 2) for use as probes «. isolate a clone of the fi.U-length coding sequence for PR0272 
Forward and reverse PCR primers were synthesized: 

for^n PCB prim - r (.fi) 5 -cgcaggccctcatggccagg-3' (seq id no 222) 

toam^nms: (.O) 5'-GAAATCCTGGGTAATrGG-3' (SEQ ID NO-223) 

5'-GTGCGCGGTGCTCACAGCTCATC-3- (SEQ JD NO:224, 
AddtfonaLy. a synU^etic oligonucleotide hybridizahon probe was constructed from the consensus DNA36460 
sequence wfch had the following nucleotide sequence 

5 -CCCCCCTGAGCGACGCTCCCCCATGATGACGCCCACGGGAACTrc.3' (SEQ W NO:225, 
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fa ordrr :c „v„^ fo, , 50,:r== of a f^l-lcnph cioar. DNA from .he Iibnm« w« screati 

PCR ^c^n with *c PCR phnrr pa^s id=n:ifi.d abov.. A po«.vc Lbrary wa. to rneC .0 «oU„ clones 
cncoflrag the PRO:72 gzrK using the probe oligonucleotide and one of the PCR primers. 

RNA for construction of the cDNA libnnes was isolated from human fetal lung nssue. DNAajr^f 
Of the clones isolated as described above gave the Ml-lcngth DNA sequence for PR0272 [herein designated as 
5 UNQ239 (DNA40620-U83)1 (SEQ ID NO:220) and the derived protein sequence for PR0272. 

The entire nucleotide sequence of UNQ239 (DNA40620.1 183) is shown in Figure 79 <SEQ ID NO-220) 
aot« UNQ239 (DNA40620-n83) contair. , single open reading teie with an apparent translational initiation site 
at nudeoude positions 35-37 and ending at the stop codon a, nucleotide positions 1019-1021 (Figure 79). Tie 
predicted polypeptide prea«or is 328 ammo acids long {Figure 80). Qone UNQ239 (DNA4O620-11S3) has been 
1 0 deposited with ATCC and is assigned ATCC deposit iro. ATCC 209388. 

Analysis of the amino acid sequence of th^ full-length PR0272 polypeptide suggests that portions of it 
possess sigrtficant homology ,0 the human and mouse renculocalbin proteins, respectively, thereby mdicating that 
PR0272 may be a novel rcticulocalbm protein. 

A consensus DNA sequence was assembled relative to other EST sequences using phr.p as described in 
Example I above, ms consensus sequence . herein designated DNA35731. Based on the DNA35731 consensus 
se^ence. oligotiucleoudes were synthesized: ,) to identify by PCR a cDNA library that eon^d the secf^e of 
interest, and 2) for use as probes to isolate a clone of the full-length coding sequence for PR0294. 

20 Forward and reverse PCR primers were synthesized: 

fopvardPCKprimT C-fl) S'.TGGTCrCGCACACCGATCO' (SEQ ID NO:228) 

f^rvvafd PCR pnnirr (J2} S'-CTGCTGTCCACAGGGGAG-B' (SEQ ID NO -229) 

toniM£mss^(.m 5'-CCrrGAAGCATACTGCTC-3' (SEQ ID NO:230) 

ronv^dPCRpriTrrr Cf4) 5'-GAGATAGCAATTTCCGCC-3' (SEQ ID NO:231) 

25 reverse pCFpn m ^ (rl) S'-TTCCTCAAGAGGGCAGCCO' (SEQ ID NO-232) 

reverse PCf^pnirrx c,r2) 5'-CTTGGCACCAATGTCCGAGAnTC-3' 

(SEQ ID NO:233) 

Add^^onally. a synthe.c oligonucleotide hybridization probe was constmc.d from the consensus DNA35731 
sequence which had the following nucleotide sequence 
30 hvbridigarinn p^^ti^ 

5'-GCTCTGAGGAAGGTGACGCGCGGGGCCTCCGAACCCTrGGCCTTG-3' 
(SEQ ID NO:234) 

order .0 saeen several libraries for a source of a Ml-Ieng* done. DNA from the Hbraries was screened 
•^PCR^onwiththePCRprin.erp.irsidennaedabove. A positive library was then used » isolate clones 
35 encoding the PR0294 gene using the probe oligonucleotide and one of the PCR primers 

RNA for construction of the cDNA Ubrarie, was Uolated from htnn^t fetal brain tissue. DNA.=a=r™ 
Of Oie Clones isolated as described above gave the fuil-.ength DNA sequence for PR0294 Piereia designated as 
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UNQ257 (DNA4060^1 187)] (SEQ ID NO:226) and dcrivrd proeb sr,u«xc fo: PR0294. 

Tbc cr.dr= a-^i=ouci= sequence of UNQ257 (DNA40604-n87) U shown b Fip:« 81 (SEQ ID NO 226) 

nu=,.=tid= posinon^ 396-398 a.d cndmg « the .op codon a. nucl=oud= posiacns 2046-2048 (F,|u„ 81) T>- 
p.d,c:=d po.yp=p.d= p^cursor . 550 anabo ac.d. Ions (F.8ur= 82). Cioc. UNQ257 (DNA40604-n87) ^ been 
deposited with ATCC and is assigned ATCC deposii no. 209394. 

Analysts of the an-^o acid .c<T^c^ of .he M-length PR0294 polypeptide suggests that portions of it 
possess significant homology to pontons of variot. coIUgcn proteins, thereby indicating ,l.t PR0294 tr.y be 
coUagrn-lixc molecule. 



A consent DNA a^sen^lcd rcladve to o^r EST sequence. u.mg phrap a. described in 

W I above. 1^. c„ sequence is .ere. ee« DNA35814. Based on the DNA35814 co.en.us 
.c^cr^e. chgonncleondes ... sy.rhe.zed: I) . identify by PCR a cDNA library ..t contai:«d the sequence of 
tnteres:, and 2) for use as probes lo isolate a clone of ±e full-length coding sequence for PR0295 
1 5 Forward and reverse PCR princn were symhesizcd: 

rorvard PCR Primer en ) 5--GCAGAGCGGAGATGCAGCGGCTTG-3' 
fSEQ ID NO;238) 

fPTOd PCR rnrn r T (.f2) 5-<:CCAGCATGTACTGCCAG-3' (SEQ ID NO-239) 

forward PCR r .mrv( G) s'-ttggcagcttcatggagg-s' (seq m no-240) 

20 forward FCP pn^^r (.f4) 5--CCTGGGCAAAAATGCAAC.3' (SEQIDNO-241) 

rTvcr ^ cPCKwr (.rl) 5'.CTCCAGCTCCTGGCGCACCTCCTC-3' (SEQ ID NO 242) 
Addn^onally, a syo^et:c oHgonucleoride hybrid^uon probe was constructed fro. the consensus DNA35814 
sequence which had the following nucleotide sequence 
hvbndizatinn prn}); 



25 



5-GGCTCTCAGCTACCGCGCAGGAGCGAGGCCACCCTCAATGAGATG-3- 

(SEQ ID NO:243) 



bv PCR 7 -ouree Of 3 fiiU-lcngth clone. DNA front the libt^es was screened 

-"^-StcPROiSSgeneusingtheproheoligonucIeotideandoneofthePCRpritne. 

RNA for constmction of the CDNA libraries was isolated from htcnaa fetal lung ti«ne DNAseoorin. 

n K cv INU.ZJ5) and the derived protein sequence for PR0295 
r- , (DNA38268-n88) is shown in Rgnre 83 (SEQ ID NO-235, 

Tat " " ""^ (ONA38268-m8, hi been 

deposed wtth ATCC and is assigned ATCC deposit no, 20W21. 
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Analysis of th= acid sc,ur.c= of th= t^,.-^± PR0295 polypcpfd. sugg„u to poniom of., 
possess signifies., homology ,o th.= uucgrm prou-:.s, Lh.r=-^ ,ndic«ung toPR0295 n-^y be , ao^.d inicgria, 

EXAMPLE 39 : Isolaiior of .nv;^ ri^ ff '. F■^r■^rt,^|, H ii- ^i m PP "-'"^ 

sc=rctcd proKms from the Swiss-Pro, publ. proteu: d.ubas= we:e used » search expressed sequence ..g (ESl^ 
d...hases. The EST da^ _ ^^^^ ^ ^ .^^ ^ 

(UFESEQ™. Incy. Pb»»ceudc.Is, P.,o AUo, CA). Tie search «as perfonned usmg .he con,pu.er prognn, 
Bl^ or BI^T2 (Altshul ec al., M.lh^is i. F T TV^^Wr 2S5:46(M80 (.99«)) as a coa,,arisoc of te ECD pro^ir, 
sequences to a 6 frame uanslation of d.c EST sequence. Those comparisons resulung in a BUST score cf 70 (or 
m some cases 90) or greater that did not encode la>own proteins were clustered and assembled into consemus DNA 
sequences wiu. the program ^hrap" (Phil Green, Umversiry of Washing„,n, Seattle, Washit^ton: 
http:/^o2emaIl.mbl.washmgton.edu/phrap.docJ/ph^ap.htInl). 

Based on an expression u.g sequence designated hereir, as T08294 identified in the above analysis 
oltgot^leoddes were synthesued: 1) to identify by PCR a cDNA library that comained .he sequer^e of int„«t and 
2) for use as probes to isolaie a clone of die full-letigth coding sequence for PR0293. 

A pair of PCR primen (forward and reverse) were synthesized: 
fcrwamPCKprimrr 5'-AACAAC}GTAAGATC3CCATCCT0.3' (SEQ ID N0:246) 
rfv;r;;FCPffriTi r r 5'-AAACTrGTCGATGGAGACCAGCTC-3' (SEQ ID NO:247) 

Additionally, a synthenc oligocuckotide hybridization probe was consmiciB. fr„,^ ,u. . 

' J™'* w=o'"'™aed from the expression sequence tag which 

/u nad die following nucleotide sequence 

hvbridi7aiini1 prnh- 

5^AGGOGCTGCAAAGCCTGGAGAGCCTCTCCTTCTATGACAACCAGC-3' (SEQ ID NO-248) 

'""^""'"^--"-^^^'^•^-'ourceofamU-lengtiic.one.DNAfrom.helibrarieswasscreene^ 
b>.pcRamp,m^^withthePCRprunerpair.dentifiedabove. A positive library was ti.cn used to isolate clor^s 
2. encoduig the PR0293 gene using ti.e probe oligotnsdeotide and one of ti« PCR primers 

RNA for const^ction of the cDNA libraries was isolated 6om human fetal brain tissue. DNAset^ 

UNQ2i6 (DNA37151-1I93)] (SEQ ID NO ^44) and die derived protein sequence for PR0293. 

The cnrirc nucleotide sequence of UN0256 mVA-iTiSt 1lQl^ ■ ^■ 

an ^ -1 ^ is shown m Figures 85A-B fSEO m 

30 ^ :244,. Clone 0NQ2.. (DNA37151.„93) contains a single open reading .ame wi. anTpparen. tralZ 
— .tear^leotide positions SSl-883 and 

::.r9 

aeposiurd widi ATCC and is assigned ATCC deposit no. ATCC 209393 

^'•""°''^'^°-'''~ofti«Ml-,engfl>PR0293po,ypepti^ 

i5 possess sign:f.canr homology to th* NLRR nrnr^ir,. u ■ ■ uuns or ii 

gy ih. NLRR protcm., thereby mdicaung ihai PR0293 may be a novel NLRR proteiit 
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A consensus DNA sequence was assembled rcUnve to other EST sequences usbg phrap as described m 
E^^le I above. "Hus consensus sequence is herein designated DNA33480. Based cn the DNA33480 consen^^ 
scq^-n.e. oli.cnucleoudes were synthesized: I) to identtfy by PCR a cDNa Horary tt.: contaned the sequence of 
interest. arJ 2) for use as probes to isolate a clone of the fuJI-icngth codmg sequence for PR0247, 
5 A pair of PCR primers (forward and reverse) were synLhesired: 

forward PCR VUXrx 5 -CAACAATGAGGGCACCAAGC-3' (SEQ ID S 0:251} 

r^vrrsr pcF rrimrr s -gatggctaggttctggaggttctg-s' (Seq id no-252) 

Addmonally. a synth.uc oUgormcleot^dc hybnduanon probe was constructed from the DNA33480 expression 
sequence tag which had the foUowing nudeoiidc sequence 

10 hybridization ^rn^7 

5 -CAACCTGCAGGAGArrGACCTCAAGGACAACAACCTCAAGACCATCG-3' (SEQ ID NO:253) 

In order >o screen scve,^ libr^ . , ^ ^^^^^ ^^^^ 

by PCR PCX ^ ^ . 

encodmg the PR0247 ge« usmg chc probe oUgonucleoudc one of Ihe PCR primen. 

RNA for coi^micdon of U>e cDNA liberies was .sola^d from ^ feaj b,.i„ dssuc. DNA™« 
of ..e cone, isoU.d . descHbed above gave ^ m-l^ DNA se^^e for PR0247 n^rein de«g»u^ 
UNQ221 (DNA35673-1201)] (SEQ ID NO:249) ^ U>e derived pro«in «q.enee for PR0247 

T>= e-tire oucleoLde .e^^nee of UNQ22I (DNA35673-1201) is shown in Figures 89A-B (SEQ ID 
NO:249, aone UKQ22I ,DNA35673.,20,) contains a singie open readir^ frame wid. an appar™ ™„Ia,iona, 
^nanon s,. a. ™.,eo.dc pcsidons 80-82 of SEQ ID NO:249 and ending a. Ce stop codon after nt^Ieond. position 
1717 of SEQ ID NO:249 (Hgttres 89A-B,. 7.e pre^oed pol^tide precursor is 546 an™, acids long (Figure 88, 
Co. 1-N.Q22I (DNA35673.I2C1) bas been deposited wttb ATCC ^ is assigned ATCC deposit no. 209418 

Analyse of tbe an»o acid setptence of the fbll-iengtb PR0247 polypeptide suggests that portions of it 
possess significant homobgy to the densm molecule and KTA Amu i. • ^- • 

m moiccuie and KIAA0231 . thereby ladicatmg Oiai PR0247 may be a novel 

J leucine rich repeat protein. 
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consensus DNA sequences were as™ relative to other EST sequences .in, phrap as described in 
Example 1 above. T^ese consensu, sequences are hereu: designated DNA35953. DNA35955 DNA35958 
DNA3...dD™3.BasedontheD.A35.3cons_ 

01 Jie tull-lcngth coding sequence for PRO302. 

PCR primers (forward and reverse) were synthesUed: 
forward PCK Orini ^ r 1 5--CTCCGCAAGGATGCCrACATGTrC-3> (SEQ ID N0.264, 
forw a rd PCR Prim rr 7 S'-GCAGAGGTGTCTAAGGTTGO' (SEQ id NO-263) 
reverie prp prt rr r 5'.AGCTCTAGACCAATGCCAGCTrCC-3' (SEQIDN0266) 
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^so, a s>..v.uc chgc^lcodd. hybnd^aoo probe ccosrructcd from ihc consensu. DNA35953 wruch 
had the foUowing nucIcoLide sequcnc; 
hybridization prnhr 

5 -GCCACCAACTCCTGCAAG,^^CrrCTCAGAACTGCCCCTGGTCATG.3- (SEQ ID NO:267) 

In orttr » ,cr«n «v=n] Bbx^c. for a souxo. of . M.lc:,g* clon=, DNA ftom U-.t libnri« was scirensd 
by PGR anplifictio^ wia, -^^ PGR prii^- p.i,, i^^aficd .bovc^ A positive Ubrary w« then used ,o i,=h=. dOBCs 
encoding ih= PRO302 gent using tht proie oligonuclMiidc and one of the PGR primrs. 

RNA for consmicaon of the cDNA Ubraiies was isolated from hmnan feol Iddney issue (UB228). 
DNA sequencing of *= do,« isola-^ as described above gave Uie fiiU-lcngth DNA sequence for PRO302 
Ihrre-x designated as UNQ265 (DNA4(r37(VI217)] (SEQ ID NO:234) and th. derived protein sequence for PRO302. 

Tne entire nucleodde sequence of UNQ265 (DNA40370-12I7) is shown in Figure 89 (SEQ ID NO 254) 
Clone UNQ265 (DNA4037(.1217) contains a sutgL- open reading frame wi* an apparent translational utinanon site 
a, t^cleoude positions 3^36 and ending at the stop codon at nucleotide positions 1390-1392 (Figure 89) He 
predicted polypeptide p^otrsor is 452 atnino acids long (Figure 90). Various unique aspects of the PR03(K protein 
are shown m Figure 90. Clone UNQ265 PNA40370-1217) has been dcposiud with the ATCC on Novetnber 21 . 
15 I bi> / and !S assigned ATCC deposit no. ATCC 209485. 

Based on the DNA35955 consensus sequence, oligonucleotides were synthesized: 1) to identify by PGR a 
CD.NA library that contained the sequence of interest, and 2, for use as probes to isolate a clone of the full-length 
coding sequence for PRO303. 

A pair of PCR primers (forward and reverse) were synthesized: 
20 forward PCP rnnn S '-GGGGAATTCACCCTATGACATTGCC-S- (SEQ id N0:268) 
rcveTSC PCR pnmrr 5'-GAATGCCCTGCAAGCATCAACrGG-3' (SEQ ID NO-269) 
Addiuonally. a synthenc oliganucIeo.de hybn^zadon probe wa. construaed from the coxuensus DNA35955 
sequence which had the following nucleoiide sequence: 
hvhHrii7annr n^nh^ 

5 J'-CCACCTGTCACCTACACTAAACACATCCAGCCCATCTGTCTCCAGGCCTC-S' (SEQ ID NO.270) 

^""^"-----"ibrar^forasourceofahUl-.eng.hclone.DNAfrom.helibtarieswasscreened 
PGR an^lUication ,^.h the PCR ^ pairs identified above. A positive library was then used «, isolate clones 
encoding the PRO303 gene using the probe oUgouucleotide and one of the PCR primos. 

RNA for constnicnon of the cDNA libraries was Uolated from human fetal hmg tissue (UB25) 
DNA sequencing of the Cones isolated as described above gave the ^.-length DNA sequence for PRO303 
[here, designated as UNQ266 ,DNA4255M2.7), (SEQ n> NO:256) and the derived protein sequence for PRO303 
nie ennre mideotide sequence of UNQ26S (DNA4255I-1217) is shown in Figure 91 (SEQ ID NOi256) 
CO. ..Q2« (DNA42551.1217) conta^ a smgle open reading .an. With an a^re^ 

a. nndeoude Potions 2<.22 and ending a. the stt^codon a, nucleotide positions ^^^^^ 

po)>pepnde precursor is 314 amino acids lone (FimiT* 921 Varin,,. , ■ 

. ^. ^ ™g ^Hgur. 92). Vanous unique aspects of the PRO303 protein arc shown 

m Figure 92. Clone UNQ266 (DNA4255M2lTi >«. >^.« h . ^ snown 

Kur^MZ2:>i 1217) has been deposited on November 21. 1997 with the ATCC and is 
assigned ATCC deposit no. ATCC 209483. 
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Ea^cd on DNA35W8 com^u. sequence, ol.gonuc,«rid« wee „ pcR , 

ccdxg sequence fc: PRO304. 

Pa:rs of PCR pnincrs (foru'ard and reverse) were synthesized: 
foru-ardPCRp^n - r 5'-GCGGAAGGGCAGAATGGGACTCCAAG-3- (SEQ ID NO 27J) 
5 f9rw,rriprp pn m^ r? 5 -CAGCCCTGCCACATGTGC-3' (SEQ id N0:272) 
forw^rj PCP nrimr^ . 5-.TACTGGGTGGTCAGCAAC-3' (SEQ id N0:273) 
levers: PCR Pri ir n 5'-GGCGAAGAGCAGGGTGAGACCCCG-3' (SEQIDN0 274) 

Addu^cnaliy. . .ynmedc oHgonuchonde hybndizadon probe wa. constructed from ^ consensu. DNA35958 
sequcuce which had the foUowing nucleoude sequence 
10 hybridization prn>;>p 

5 .GCCCTCATCCTCTCTGGCAAATGCAGTTACACCCCGGAGCCCGAC.3' (SEQ m N0.275) 
'""'^^•"'"--"■-'^''■^'^^--of.ftUI-.cngthdo^.DNAfron,^ 

cncod^g PRO304 ^ u.tog a,c probe oUgo^c.=oud= a:„i on. of .he PCR pr™„ 

RNA fc „f ^ ,„NA Ub™« .0^^ fro„ ^ ^^^3, 

aNA 0,^ „^ ^ ^^^^ ^ ^ ^ ^^^^ 

rn^ dc.^.d . UNQ257 (DNA39520-I2.7), (SEQ ID NO:258) U.c derived p,o«i„ „^ pRO304 
Tl.= «.T= nucleotide se<p:eoce of UNQ267 (DNA39320.12I7) h .hown fa, F,gure 93 (SEQ ID NO 258) 
ao.U.Q2.(DNA3..0.,2,7)co:,^...,,^„.^^,^,^^_^^J^^^^^^ 
nucecode po^uions 34-36 3.0 endu^ . ^e ™p codon at nucleoude pcsiaons 1702.,7<M (Figure 93) n,e 
pre.c=d poi^ptide preo,r,or is 556 an^i™ acd. ,„„g (p.g„, variou, uui^ ..pec. of the H,O304 protefa. 
are .bow. . P.g... ^. cone UNQ267 (DNA39520.12I7) ba, been deported wiu, ATCC Co Novercber 2. ,997 
and IS assigned ATCC deposit no. ATCC 209482. 

Based on the DNA37,60 cot^ensus sequence. oUgonuCeotides «re syntbe^d: 1) to identify by PCR a 
.DNA ,.tao- .bat contained .be sequence of invest, and 2) for use as probes to isolate a Cone oftfjl-iengtb 
=odmg sequence for PRO307. ^ 

Pairs of PCR pnmers (forward and reverse) were synthesized- 
forward Prp p n ^ ^ r I S'-GGGCAGGGATTCCAGGGCTCC-S' (SEQ ID NO-276) 
fopv^rrtPrpprw r- S-GGCTATGACAGCAGGTrC-B' (SEQ ID N0377) 

ferwarrt pfP primr^ s'-tgacaatgaccgaccaggo' (seq m no-278) 

I'L"" 7 ^" ^ " ^ ^■■<3CATCGCATTGCTGGTAGAGCAAG.3' (SEQ id no 279) 

A^dutonally a syn.edc oUgotn^ieotide hybridt^bon prol« was constnreted front .be consensus DNA37,60 
sequctce whtcb had the following mscleotide sequence -JNAJTieo 

5-TTACAGTaccCCCrGOAAACCCACTTGGCCTGCATACCGCCrCCC-3.(SEQIDNO:280) 

cn the PCR pruner patrs tder^fi^ ^ p,,,,,. ^ . ^^^^ 
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encoding uie PRO307 gene usin? the probe oligonuclcoiicle and one of the PCR prin:trs. 

RNA fo: comt-ucdcn of me cDNA hbrancj was isolated fron human fetal liver dssuc (UB229}. 
DNA secr^ncuig cf ihe c:on^ isolarcd a5 described abov= gave th: fulMcngih DNA sequence for PRO307 
rncrem designated as UNQ270 (DNA4:225-12I7)] fSEQ ID NO:260) and the derived protem sequence for PR03Cn. 

T7i= enure nucleotide sequence of UNQ270 (DNA41225-1217) is shown m Figure 95 (SEQ ID NO-260) 
Cione L^Q270 (DNA4 122542 17) coman. a single open reading frame w:th an apparenr oansladon^ ^uaxion sue 
at nucleotide positions 92-94 and ending at the codon at nucleotide posinons 1241-1243 (Figure 95). "me 
predicted polypeptide precursor is 383 amino acids long (Figure 96). Various unique aspects of the PRO307 protein 
are sho^ in Figure 96. Clone UNQ270 (DNA4 1225-121 7) has been deposited with ATCC on November 21. 1997 
and IS assigned ATCC deposit no. ATCC 209491. 

Based on the DNA30895 consensus sequence, oligonucleotides were synthesized: 1) to idennf>- by PCR a 
CDNA library- that contained the sequence of mterest. and 2) for use as prt^bes to isolate a clone of the full-length 
coding ssquence for PR0343, 

A pair of PCR primers (fonvard and reverse) were symlicsizcd: 

fnrwarHFrRnriUfr 5--CGTCTCGAOCGCTCCATACAGTrcCCTrGCCCCA-3' (SEQIDNO:281) 
15 reverse PCR primi-t 

5-TGGAGGGGGAGCGGGATGCTTGTCTGGGCGACTCCGGGGGCC 
CCCTCATGTGCCAGGTGGA-3- (SEQ ID NO:282) 

Additionally, a synftetic oligonucieotide hybridization probe was commuted from the con«nsus DNA30895 
sequence which had the foUowins nucleotide sequence 
20 hvbridi7annn nrnh^ 

5 -CCCTCAGACCCTGCAGAAGCTGAAGGTrCCTATCATCGAC 

T-GGAAGTCTGCAGCCATCTGTACTGGCGGGGAGCAGGACAGGGACCCATCACTGAGGACATGCTCT 
GTGCCGGCTACT-3' (SEQ ID NO:283) 

to order to screen several Ubranes fora source of a Ml-lengd. clone, DNA from the libraries was screened 
by PCR ampiificanoo „>m the PCR pntr.r pairs idemif^d above. A posinve library was then used to isolate clones 
encoding the PR0343 gene using the probe oligonudeoude and one of the PCR printers. 

RiNA for cocstrucDon of the cDNA libraries was isolated from human fetal hmg tissue (UB26). 
DNA se<juencing of the clones isolated as described above gave the M-length DNA sequence for PR0343 
[hereut designated as UNQ302 (DNA43318-1217)] (SEQ ID NO:262) and the derived protein sequence for PR0343 
The entire nucleotide sequence of UNQ302 (DNA433I8.ni7) is shown in Figure »7 (SEQ ID NO-262) 
Cior. UNQ302 (DNA43318-.2.7) contains a smg.e open readmg frame with an apparent transUtional initiation site 
at nucleodde posinons 53-55 and ending a. the stop codon a. nucleoude pos.Uotis 1004-1<X)6 (Figure 97) 
predicted polypepdde preoirsot is 317 amLio acids long (Figure 98). Various unique aspects of the PR0343 protein 
are showninHgure 98. Clone UNQ302 (DNA43318-.217, has been deposited with ATCC on November 21 1997 
J :> and 15 assigned ATCC deposit no. ATCC 20948 1 . 
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E XAMP LE ■«2 : hoiarion of ^n^^A ^-| ^..^ Frn'iirp H-imn PP ^^?° 

A concensus DNA .c,.=:^ EST usi^g phmp d=.cr:bci m 

Exan,'= 1 abov. u,, „ ^ ^^^^^^^^ 

s-.^.cn.=, ohgonucl^cndcs wcr. synt.„.=d: I) ,o :«nufy by PCR a cDNA l,b:a,y Uut „d L,. of 
m,««:, ans 2) for u,c ^ probes .o bolaic a cior« of ^ fulHengm cod>r.g ,=q.cr« for PR0328 
J Forward and reverse PCR primers were symhesued; 

f^-ardPCRrnmrr 5--TCCTGCAGTrTCCTGATGC.3' (SEQIDNO-286) 
reverie PCR .nm - r s-ctcatattgcaCACCAGTAATTCG-J' (SEQ id NO-2S7) 
Add,„onalIy. a synthedc oligo^ieoude hybrid«a™„ probe w« consrrucred frorr. d,c come>«us DNA35615 
sequence which had Lhe following nucleotide sequence 
10 M'bridi7atior prn"^ 

5-ATGAGGAGAAACGTrrGATGGTGGAGCTGCACAACCTCTACCGGG-3' 
(SEQ ID NO:288) 

h order u, screen several librane« for a source of a fcll-lengfl, clone. DNA from the libraries was screened 
_ by PCR aroplincanon with *e PCR prtmer pair idenufied above. A po^rdve library was men used ,o isola« clor^s 
X. er^od^ the PR0328 gene using me probe oligonucleotide and one of U« PCR primers. 

RNA for construction of the cDNA libraries Was isolated from human fetal kidney tisme 
DNA sequencing of the dorrs isolated as described above gave me fuil-lengm DNA sequence for PR0328 
Ihereu, des.gnated as UNQ289 (DNA40587.n3I)] (SEQ ID NO:284) and the derived protein «^nce for PR0328 
The enure nucleotide sequence of UNQ289 (DNA405B7-123I) is shown in Ftgure 99 (SEQ ID NO-284) 
10 Clone UNQ289 (DKA.05S7-I231, contains a single open readmg f,ame wim an apparent translatioaal initsauon stte 
a. r.uc,eo.de posidons 15-17 and ending at the stop codon at m^eotide positions I4O4-1406 (Figure 99) Tlte 
preicted poiypepdde preo^sor is 463 ammo acids long (Figure 100). Clone UNQ289 (DNA40587-1231) has been 
deposited wtth ATCC and is assigrad ATCC deposit no. ATCC 209438. 

Analysis of the amino acid sequence of .he full-lengm PR032a polypeptide s.rgges,s that portions of it 
5 possess s,gnifi=ant homology ,o the human ghoblastoma protein and to the cysteme nch secretory protein hereby 
.nd,=a.i-,g that PR0328 may be a novel ghoblastoma proteir. or cysteine rich secretory protein. 

Emm^: bflUi i on nf rm, rinnr. f^..^. p p^ ^„ pp„„, „^ ^ ^^^^^ 

A conse,^™ DNA sequence was assembled relative u, other EST sequences using phrap as described in 
E^mple 1 aoove. This consensus sequence is herein designated DNA36685. Based on me DNA36685 consensus 
sequence, and Incyte EST sequence no. 2228990. oligonucleotides were sycmesized: 1) to identify by PCR a cDNA 
horary that cont^ed me sequence of interest, and 2) for use as probea to isola. a clone o, ti>e fUll-lengU, codtng 
scouenc: for PR0335. PR0331 or PR0326. 

Forward and reverse PCR primers were symhesized for the detenninadon of PR0335- 
WdPCRp n T r r 5 -GGAACCGAATCTCAGCTA-3- (SEQ ID NO-295) 

MPCRr rimn 5--CCTAAACTGAACTGGACCA-3- (SEQ id NO-296) 

ferwaid prp m n^i 5--GGCTGGAGACACTGAACCr-3- (SEQ ID NO 291) 
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for^rd PCRT?r i mgr 5'-ACAGCTGCACAGCTCAGA.ACAGTG-3' (SEQ id K0:298; 

r'v?r,.; FCR Trnmr 5 -CArrCCCAGTATAAAAAmrC-S ■ (SEQ id N0:299) 

r:v;rsg PCR prn ir r 5'-gggtcttggtgaatgagg-3' cseq id no:3oo) 

r?vcrsg PCR mx:,: 5 .GTGCCTCTCGG'rrACCACCA.^TGG-3' (SEQ ID NO:301) 

Add:nor3lly. a smhcnc ongoi^.ondc hybntoon prob= wa. coru^^t^ for ih= d=:.n^aon of PR0335 which 
5 had the following nuclsoiidc sequence 
hybridization P^P^f 

5 ■-GCGGCCACTGrTGGACCGAACTGTAACCAAGGGAGAAACAGCCGTCCTAC-3 ' 
CSEQ ID NO:302) 

Forward acd reverse PCR primers were symhcsized for ihe dctennmadon of PR033 1 : 
10 for^gr j PCR priTTir - 5'^CCTTTGACAACCTTCAGTCACTAGTGG-3' (SEQ ID NO:303) 
r^verge PCR Primer 5 '-CCCCATGTGTCCATGACTGTTCCC-3- (SEQ ID N0:3W) 

AddiiionaUy. z syr^ ohgomdecndc hybndizauon probe wa. con^mictcd for the drt^rmmation of PR033I which 
had the following nucleotide sequence 
hvbridizaTion prnn^ 

1 i ^ -^^-^-TGCCTCATGACCTCTTCACTCCC7TGCATCATCT^AGAGCGG-3• 
'S£Q ID NO:305) 

Forward and reverse PCR primers were synthesized for the dctenninarion of PR0326: 
ronvar^f PCR vnmr 5 '-ACTCCAAGGAAATCGGATCCGTTC-3* cseq id NO:306) 
river s ? PCK Primf 7 ^'-TrAGCAGCTGAGGATGGGCACAAC-S' (SEQ ID NO:307) 

AddinonaUy. a sy..ihedc ohgonucleonde hybridization probe was constructed for the determination of PR033I which 
had the following cucleoticc sequence 
hvbridi2 
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5'-GCCTTCACTGGmGGATGCATTGGAGCATCTAGACCTGAGTGACAACGC-3- 

(SEQ ID NO:308) 



In order to screen severaJ hbr^nes for a source of a fuU4ength clone. DNA from the libraries was screened 
by PCR amphfcanon wrm ^ PCR pri^. p,rs identified above. A positive library wa. then used to folate clones 
encod:ng PR0335, PR0331 or PR0326 gene using the probe oligonucleotide and one of the PCR primers. 

RNA for constr^cuon of the cDNA hbr;mes was isolated from human fetal kidney tissue (PR0335 and 
PR0326) and human fctaJ brain (PR0331). 

DNA sequencing of U,c doces isok^d d«crib=d above gave He M-length DNA ,eq,.eoce for PR0335 
PR0331 or PR0326 [herem designattd a, SEQ ID NOS:289, 291 ami 293, respectively; «= Fi|m« 103A.B 105 
and 107. respectively), and ±c derived pro«m sequence for PR0335. PR0331 or PR0326 (see Figures IO4' 106 
ar.d 108. respectively; SEQ ID NOS:290. 292 and 294, respectively). 

Tte ennre nndeonde sequences are shown in Figures 103A.B, 105 ^ 107, deposited with the ATCC on 
35 June 2, 1998. November 7. 1997 and Noven^er 21, 1997, respectively. 

Analysis of the amino acid sequence of the Ml-length PR0335, PR0331 or PR0326 polypeptide suggests 
that porfons of „ possess sigmfieant homology ,0 the UG-1 protein, thereby indicating that PR0335. PR0331 and 
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PR0326 may be a novel UG-1 -related proic:n. 
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Based upon an BCD hcmclog>- search pcrfcnned as dwcri'Dcd in Examph 1 above, a consensu. DNA 
sequence designated herem a. DNA36688 was assembled Based on the DNA36688 consensu seque-.= 
5 oh^omcleotidcs were s>^:zed .denufy by PCK a cDNA library ih.: conuuned the sequence of mteresr and fc: 
use as probes to isolaic a clone of the full-length coding sequence for PR0332. 

A pair of PCR primers (forward and reverse) were symhesizcd: 
5 -GCATTGGCCGCGAGACTrTGCC-3- (SEQ ID N0:31IJ 

5 -GCGGCCACGGTCCTrGGAAATG-3 (seq ID NO:312) 

10 A probe was also synthesized: 

5'-TGGAGGAGCTCAACCTCAGCTACAACCGCATCACCAGCCCACAGG.3' 
(SEQ ID NO :3 13) 

Ln onla xo «r«n severe honric. for . „>„c= of a m-\.^^ done. DNA from Ubrari« wa. «™c<d 
oy PCR ^lificazion whh .be PCR p,, id.™if„d abov. A pciuve Ubrary w« u.=d u. isolat. clo«s 

11 t.^odmg the PR0332 gene usmg Uic probe oIigonucl=otidt and one of .he PCR primers. 

RNA for consirxtion of Cie cDNA Ubiari« was bolaied from a hmnan fetal liver libiarV (LIB229) 

DNA seqaeaor^g of me clone, i.olared as described above gave M-lengrb DNA sequence for 
DNA40982-1235 and the derived protein scqoe^e for PR0332. 

TTk eimre nucleodde sequence of DNA40982-1235 is shown in Figures 109A-B (SEQ ID NO 309) Clone 
20 DNA40982-1235 contains a s^glc open reading frame (with an apparent translattonal ininadon site a. nucleotide 
posmons 342-344, as indicated in F.^es 109A.B). TTte predicted polypeptide precursor is 642 amino adds long 
and has a c^cnlated tnolccular we,ght of 72.067 (p,: 6.60). Clone DNA40982-1233 has beet, deposited wia. ATCC 
and IS assigned ATCC dcposn no. ATCC 209433. 

Based 0. a Bl^T and FastA se<^nce alignment analy^ of *e Ml-length sequence. PR0332 shows about 
2. 30^0% ammo acid sequence idennty with a series of known proteoglycan se^es, including, for e«mpie 
fbromoduhn and ftbromoduim precursor sequences of various spcctes (FMOD BOVIN. FMOD CHICK FMOD 
R..T, FMOD MOUSE, FMOD HUMAN. P. R36773), osTot^dulin seqt^nccs (ABOOOIM 1. AB<»784g 1)' decorin 
se^es (CFU8314U. OCU033H.. P. R42266. P R42267. P.R42260. P.R89439), Iceratan sulfate proteoglycans 
(B™ 360_1. AF0a2890_l,. corneal proteoglycan (AF022256_1), and bone/carti^ge p Jg,yoal\nd 
30 proieoglycane precursors {PGSl.BOVIN. PGS2. MOUSE, PGS2.HUMAN). 

EMMELE4i: to i aMn of cdna cinn.. F„.r,^i^r Hnmnn r un-i 

A consensus DNA sequence was assembled relative to other EST sequences using phrap as described in 
E,amp e 1 above. Based on t..e consensu, sequence, oligonucleotides were synthesized. 1) .„ identify by PCR a 
- . CDN A hbrary that contamed the sequence of .terest. and 2) for use as probes to .solate a clone of the full-length 

coding sequence for PR0334. 
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Forward ana r.v=.= PCR ^ ^^^^^ ^^^^^^^^ 

M Kf. Pn n r - ^'-GATGGTTCCTGCTCAAGTGCCCTG-a'CSEQIDNO a.e) 
"'^^"'"^ ' '" ^^ ^ ^--TTGCAOTGTAGGACCCACGTACGO (SEQ ID NO SIT; 

Adciconally, a symhruc oligorrjciroad; hvbridizatior probe wa.. c.nn*m,r. ^ f . 
Had the following nuci=onde sequenci 
5 hvbridizarinr> y^p^??^ 

5-CTCATCGGAGGACCTGTGTAGATGTrGATGAATCTGCTACAGGAAGACCC-3' 
(SEQIDNO:318) 

cncod^g PR0334 .^g ^ prob. oU,™.„„,id= on. of ^ PCR pri^„ 
H.^ I^^al cDNA Ubra.« „ ^ .j,,^ ^ ^ 
_o:aI.y available r^gcnB such as mo« ftcm Invirrogc, Saa Diego ^ ^ ""^ 

DNA ^q^rrir^ „f ^ clonn „olau-d as described above gave *e fulI-ln^* DNA secuecce for PR0334 
^re. des...d as DNA.3...3, (SE, ,D KO.M) derived pro^se^ ^Z^ 

csE..r:t;etz:™Z""^ 

(Figme 110). Clone DNA41379.1236 has been 
depos.ttd w,d> ATCC and is assigned ATCC depos., no. ATCC 209488 

Analysis of *e anuno acid sequence of d,e fuH-ieng* PR0334 p„,ypep,de suggest ,ba, portions of i, 
possess significajii tomology to ihe fibulin and fihwiv v is» su mai poraons of ii 

~ Of .,e EGP prc^L,, ^ " — 
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A consensus DNA se^^. ulennficd ph^p ^ ^ ^ ^^^^^ 

coascnsmsccpcnceishersind«igiiacdDNA3824n r«.h w tsv, ^pecincaily. this 

gnaicauNA38240. Based TO rhcDNA38240comciisus sequence oU^onuck^ 
were synchisized: 1 ) to identify by PCR a cDNA lihr. ougonucieotides 
nrnh-. , . ^ ^'CR a cDNA library that comamed the sequence of interest, and 2) for me as 

prab« to isolate a clone of the fuU-icngth PR0346 coding sequence 

R-NAforcor^ncnofthecDNAlibrarieswasuolatedfrcmhu^fctalliver cDNA Hbr. ' . 
to isolated the cDNA clones were constructed by standard method ■ 

Tm^r. c r.- Standard methods using commercially available reacents fe e 

Invitrogen, San Dicgo, CA; Clomech, etc ) Hoc cDNA w., r^rr. ^ v 

1.1 r. ^ o^KO containing a NotI site ImVi-rf with 

blunt to Sail heraildnased adaptors cleaved wirhMnM • . g a ixou site, linked with 

•"■■"""""■'■■Sf-..mii7...ao,i»,„i.»,rt»^,„„„^, 
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pr;:cur£or is 450 amino acidi long. Clone DN'A44167 T^d^t v. ^ ■ 

..... „o. ATCC .0.3. „„„ ™ " '''' 

^ sequence itou:y,„,„,^,^^„^,„„ 3 

OLI2691 f38240.n) ^ 

5--GATCCTGTCACAAAGCCAGTGGTGC-3- (SEQ ID NO:321) 
OU2695 (38240.rl) 

5'-CACTGACAGGGTTCCrCACCCAGG-3- (SEQ ID NO 322) 

OU2692 (38240.pl) 

3 .CrcCCTCrGGaCTGTGGACT.TGTOOGaAACATGACCCTGACArG.3. (SEQ ID K0:3.,, 

, .ovc ™. „ , ^.^^^ ^^^^^^^ ^^^^ ^ J™":": 

Forward and reverse PGR pruntrs were syniteized- 
^^r^miK^ 5 -TGAGGTGGGCAAGCGGCGAAATG-3' (SEQ ID N0.326) 

5 -TATGTGGATCAGGAGGTGCC-3. (SEQ ID ' 
'^CE^ 5-TGCAGGGrTCAGTCTAGATrG-3- (SEQ m NO:328) 
™^ ^■■TrGAAGGACAAAGGCAATCrGCCAC-3- (SEQ ID NO- 329, 

hvbndiTaTinn pmK^ 

S -CCAGTCrrGCAGTTCCCCTOGCACTCCrGGTGCrGrrCCnTGGG.. (SEQ ,D .0.30 
.=oo«g the PR0268 u,u« tie probe oUgom,deo«de and one of fl.e PGR pricer, 

(D.A3.2., n JsE TZlZZT 7 " """" ^-^'^^ " 

^ li^ N0.324) and the demed protem sequence for PR0268 

/pep precursor i. 280 amino acids long (Figure 1 14). Clone UNQ235 rDNA39427 1 HQ. h., 
deposited with ATCC and is aisienerf ATrr a tiJi>iAjy427-] 179) has been 

^ 'UJQ IS aisigncd ATCC deposn no. ATCC 209395. 
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Analysis of the 3^ .cid secu.nc= of ^Mcngth PR0268 polyp^ndc sugg«. tiur h possess 
s»: ho™icg>- . p:o:^ ^^^^ ^^^^^^ ^ ^^^^^ ^^^^^^ 

isonirras:. 

£XAMPL.B^8 : holanor nf rp^'^ ri^ p.. p^^,,^ 
5 A consensus DNA sequence assr^bled rclanve to other EST secuecces usm, phrap as descnbed ^ 

Ex...le , above. T^s consensus sequence :s herein designated DNA35730. Based on the DNA35730 consensus 
sc^e, oUgonucleoudes were synth«.ed: I) to i,^fy by PCR a cDNA Ubrary ^: contan^d the sequence of 
u:r=r=st, and 2) for use as probes to isolate a clone of thr full-length coding sequence for PRO330. 
Forward and reverse PCR primers were synthesized: 
10 rprwardPri^ vm r r A 5'-CCAGGCACAATITCCAGA.3- (SEQroNO:333) 
forward PCR primrr? 5--GGACCCTTCrGTGTGCCAG-3' (SEQIDKO:334) 
rev>;,e PCR pri fpr r 1 5'-GGTCTCAAGAACTCCTGTC-3' (SEQ ID NO:335) 
r^vrs^PCKT^rmr-? 5'.ACACTCAGCATrGCCTGGTACrrG-3' (SEQ ID NO:336) 

AddinonaDy. a synthetic oligonucleotide hybridization probe was constnirr<=rf fmrr, rh 

^ u«-iuon p.oDc was constructed from the consensus sequence which had 

1 J Jic lollowing nucleotide sequence 

S'^GGCACATGACTGACCTGATTTATGCAGAGAAAGAGCTGCTGCAW (SEQ ro N0.337, 

'""''^^'''--"-■^'ibzan.feasourcecfaM-bngftd^c.DNAfrom.h.librarieswa^s^^ 
by PCR ^pMca^n wim U.. PCR pri™, p.i. ,aeadfi.d A posiUvc li.:^ to ,o i,„^„ c>o«s 

cncod„,E a,c PRO330 g«« usmg probe oligo^clcoud. ard one of the PCR pria,cr, 

RNA for consm^oicn of *e cDNA Ubr^e, wa, isolated ftom 1^ f.^ yver msue. DNAse,^ 
of .he cone. .ola.ed . de^ribed above gave d. M-len,m DNA sequence for PRO330 p^rei. d«igr.„d as 

UNQ290 {DNA40603-1232)](SEQIDNO-3'nandA.H„iv H «igmtea as 

ri wcv "J and ihe derived proiein sequence for PRO330 

-n. ennrenucleodde sequence of UNQ290 (DNA40603-n32) is shown in Fi^-e 1,5 '(SEQ ID NO-33I) 
a<.eUNQ2.(DNA^3.fZ3.,con:ainsasin..e open read., .an. wid-anappare.™^^ 

n.cleonde pos^uons 167-169 and ending a, *e s.„p codon a nucleoode posinons ,76^,768 (Hgure ,15) T^e 
prc.c.d po,« preo^or is 533 amir„ acids long (Pi,.e 1,6). Cone UNQ290 (DKA40603-,232) has been 
cep„.,.d w,* ATCC and is assigned ATCC deposit no.ATCC 209486 on Noven*er 2, ,997 

.Analysr. of d,e an^ acid sequence of the ful,,engrh PRO330 polypepdde suggests .bat portoos of it 
posse s s-gn^ncan, hon..ogy . t^ n^ ^„ ^ ^^^^^ ^ J 

n^.ay be a novel prolyl 4-hydroxyUse alpha subunit polypeptide. 

B^AMEL£4g: ispiatjffn of rPNA none. Pn^.^i^f h„^,„ rn"i|(i 

A consensu DNA sequence was assen^led r^ve to other EST sequet.es using pbrap as described in 
s!2: — — ~3. BasedontheDt:0553consel: 

zr: r " '"^'^ ^ - - — ^f 

mtere,.. and 2) for use as probes to isolate a Cone of the ft.,-lengd, coding sequence for PRO310. 
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Forward and rcv-rse PGR prhrcrs were synthesized; 
I gr>v,:^ PCR rnm.^ 1 5'-TCCCCAAGCCG7TCTAGACGCGG-3- (SEQIDN0 34.. 
f^r.,:jPrP Prin-^ 5.CTGGTTCTTCCTTGCACGo- fSEQIDNO-545) " 

^'-<^CCCAAATGCCCTAAGGCGGTATACCCC-3' <SEQ ID NO-344. 

the foUowmg nucleondc sequence 

r-OC0TCTO.TaCTTC0^OCArmCTaTCCTrro™.TOCTAOCAC.^ 

=nctag a: fte «op codoB a> n-uclcoto posiuon 1035-1037 (Figure 119), Ti. „„dic«. 
pol)Tepadcprccurscr,s 318 am™ acids lone (FiBurenm^Hk ,, ' «^ predicted 

36,38: daltor.. Clone DNA43046-P25 Jh T ' =f .PP-xinutcly 

Analysis of fee amino acid jcqucnc: of iht flill-knirth PROJin 
d=v=lop™n.. """"" '^^"•^^ ""^''O -y b= involved in 
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Ancrpt«scdscq-jcnccia«(ES-nDNAdaabaa(UFESEO-'« in^, 

5..CCCTCATCTACCCOCTCC.3. (SHQ .o'^, ^ ^ 
~m-U 5-.OTGTGACACAGCGTGGGC.3- (SEQIDN0.43, 

^■■CACCGOCAGGCTTCTGCG-3. (SEQn,N0.44 
™^ 5.CAGCAacrrCAGCCACCACGAGTGG-3. (SEQIDNO.S) 
™^ -CTGAGCCCTCGGCrGCAGTCTCG^- (SHp^.o T 
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favbridizaiinn pr^l^ff 

..CCC.CTACO.CTaOrrCnC.TCArcCAaOATOACACATATCToC-3. CSHQ NO.,) 

™-otog PR0339 g=n= usmg U>c probe oUso-mckoudc and ca= of the PCR prim=„ 

w« .cqucnc^d ,r. The c^re nudcoude of DNA434«6 1225 « .h ■ T 

NO:338). Cta DNA4346^,225 co«». a °-^^^34«6.1225 . shown m F.gme 117 (SEQ ID 

Arm A -^^^^^^^-^ *^ ^posited with ATCC and is assiened 

ATCC deposit no. ATCC 209490 «5ignca 

Based on a BLAST and FastAscouritcaTi^r™^. > • . • 
fr^ic U^creby n,d.ca^ ^. PR0339 nuy b= Involved in dcvdcprnent or U^ue grow*. 

20 ,„forPR0244. ' °' """" >"°'«^ -<o„c of ^ M,.,e.,U, coding 

A pa. of PCR priincr. (forward and reverse) were synteized- 
5'-TTCAGCTrCTGGGATCTAGGG.3- (30923.f„ (SEQ ID NO-378) 

5'-TATrCCTACCAT^CACAAATCCG.3. (30923.rl, (SEQ ID NO-STS) 

A probe was also synthesized; 

2S ;-™OCCAC^^^^^^^^ ^^^ ^^^ ^^^^ 

"."^o.g Of .rrirrrrr^^'^"^"^"-^™''----^--- °- 
col: xr—riL'"'" ' ^^^^ ^ °- « 

u.^AJ5663-ll71) and u assigned ATCC deposit no ATCC20937I 'n,.^ u 
cytoplasmic domain (aa 1 -20^ a rr;,n^, u . oo- a i l,czo^7 I . The protein has a 

^ -^^^r « transnjcmbrace domain raa ')t^^^ j 

aomam (aa 2M6J. and an extracellular domain (aa 47-219), with 
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2 C-lcctin domain at aa 55-206. 

B«»d or a BLAST >M FasW «<p«n« al.gm,™ a;^ysis of MMcngth sequeoct. PR0244 shows 
no.bl= am™ aad sequence .denno, to hepauc lecm gallrn gallos (43%). HIC hpl2(^bmdmg C->T,= Icca. (42%) 
macrophage lectin 2 (HUMHML2-1. 41 S), sequence PR32188 (44%) 

The Mowing describes use of a nndeonde seq»« encoding a PRO poiypeodde as a bybndhanon 

probe. 

DN A comprising .he coding sequence of of a PRO polypeptide of inierest as disclosed herein may be 
einployed as a probe or used as a basis ftom which ,o prepare pro'oes .o screen for homologous DNAs (such as drose 
10 enco^ng na«rally^ccurnng variants of the PRO polypeptide, in hnman nssue cDNA hbraries or human tissue 

genomic libraries. 

Hybridizanon and washmg of aters containmg either Ubrary DNAs ,s perfonr^d under the following high 
stnngency conditroos. Hybridization of radiolabeled PRO polypcptide^oding nucleic acid^ved probe to the 
niters IS performed m a solution of 50% fonnamide. 5x SSC. 0.1% SDS, 0.1% sodium pyrophosphate. 50 mM 

1. '-^-P'»»P'».=.PH6.8.2xDe«soluuon.andl0.dexrra.sul^,eat4rCfor20hcurs. Washingofthe 
aiers IS performed in an aqueous solution of O.u SSC and 0.1 % SDS at 42°C. 

DNAs having a desired sequence idenhty with the DNA encoding ftill-length native sequence PRO 
polypeptide can then be identified using standard techniques known in the art. 

20 UAMELBJl: E:toressinn of P^p Pr|. ^„^^ ^ ^ ^^ j^ 

Ihis exaniple illus^es preparation of an ung.ycosyla.ed fbrm of a desired PRO poly^tide by recombinzn, 

expression in E. coli. 

^■'■''^^-^--"^■bedcsitedHRO^lyi.ptideisinm^^^ 
^''''--"''"^''^-i'-'-.ctionenzyn.esi.eswhichcorrespondtotherestriction 

TTrr yl^^. Anexan,,eofa,uitablevcaorispBR322 

(derived from £. .oU; see BcUva, et al.. ^. 2:9S (m7)) which contams genes for ampicillin and tetracycline 
t^,sta..c=. The vector . digested with restricnon enzy.. and dephosphorylated. The PCR ampUfied sequences are 

^nhg^dmtouie vector. The vector Will preferably u^lude sequences Which encode for an anti^otic^ist^ 
gene, a trp promoter, a polyhis leader fmcluding the first si, m codons. polyhis sequence, and entero^e 

30 ----).'^«P«ificPROpo,ypeptidecodingregion.lambda.ranscriptionalte»^^^^ 

The hgation n^mre is then used to transform a selected £. strain using the meU«,ds described in 

co-onies^ethensel^d. P^ONAcan.UcUedand confirmed by restricti^ 

^5 annh T ''°'™ " " '""^ ^^-^<^ with 

— "-^"-'^^''."'--^.ubsequent.ybeused.oinocula.ea.^^ 
grown to a destrcd optical density, .firing which the expression promoter i, turned on 
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o.a.» by cc.^^ga^on ^ ^ . ^ „. ^ 

pc,^,,.« =30 U.,c. -oc p.m=d . ^^^^ 

PP.0IS7. PROan, PRO301, PROZ:. pro23S .c. s.cc«.f„0. „ ^ , 
5 u^=dfon.,.^*efoBowi:,p.occau„. Tl. DNA cncod^ PRO,87, PR03n. PRO30I. PR0224 orrR0238 

10 -"---f--^.-.-s.b..„„,„^,,OV3no^,„„.),„,,^^„,^,^^ 

E. CO), paste from 0.5 L fennc„.n„^ (6-10 g pcUcu) was „su.p«<.„i i„ lo vohnn« (w/v) i:, 7 M 

20 ■"---P™-wi*aHc,.„^.„i..«.o.^^ 

buffer (6 M piaiudinr. 20 mM Tris dH 7 4^ ^ v ™o oi ineai chelate cohmm 

„ , , °™ '•"> fil'"=d throujh 0.22 micron fflter, ,t, clarify. Depending to 

. nfie. «... Ioa.e. . s ^ gu,cn Nl-NTA chelate colu» c^.^c. L l^ZZ 

~ r r "'"^ ™ ^'"'^ - - cca.io.it: 

.a..ula,edjximcLon=oeffieienl based on amino acid sequence 

U.3 M NaCl. 2.5 M urea. 5 mM cysteine. 20 mM glycine and 1 mM EDTA Refoldmc 

a rinaJ conctntradon of 0 47 «^ ai«uuun oi irA to 

a«-ed* K Bcfec further purificadon of me pro«in.U«,ol»don was 

ateredfcough. 0.22 micron filter and acetomtrile was added in 7 ioi« i "i. u« Joluoon was 

was chro^^graphed on a Potos Rl/H r=ve H T ^''O'^ fi-" concentranon. Tl„ refolded protein 

a ^ad,=o. of - , ' ""^ ""^ ' '^'^ °f ™ ''"^cn with 

^ad,= . Of acetonttnle front .0 to 80%. Ali,uou of fracttons with A280 ahsorbance were analyzed on 

P«e.,sandfra.onscon.au.,.„n..eneousrefo,d.pr„.e.we„^ 

^-ded speces of n,o. proteins are Cted at .e lowest cot^entrations of accto^trUe since those UTto 
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most ==:,^a whh d=:r hydropbob. ^on ,m=id.=. from imcr^ction with ihe r.v«cd ph«= x«m Agg„ja„d 
sp=c:« ^rc usually clu«d « higher ,cc,on,uUc co«„«uon.. In .ddiuon » „,„,v^ ..i,f„,ded fom. of pro,ei:» 
tron d«>red fot:n, Uie reversed pha.e srep .ho removes e-Tdou..^ from ^.e saaiples 

Fr»c«ons co.«^ desired folded PR0187, PR0317. PRO301. PR0224 »d PR0238 prctt^ 
respectively, were pooled ^nd ^ ,=e»ni:rii= reeved usir,g . genUe srrezm of m^gec direned „ the solunor, 
5 Pro.^ were fomr^^d .mo 20 rM Hepes. pH 6.8 with 0.14 M sod.™ chloride »d 4% roaimitol by dialysis o- 
by gel filtraao. usrr« G25 Superfine fPtormaci.) resins equilibrated m the fonrmlation buffer and sterile filtered. 

EXAMPLE 54 : Expressi-v, of Pl>n Priy yT-r ffl - T in M»,T,m.l„n 

ms example Ulustrates preparation of a glycosyUted form of a desired PRO polypeptide by recombinant 

lU expression m mammalian cells. 

T>.e vector, pRK5 (see EP 307,247. published March 13. 1989). is employed as the expression vector 
Opttonally. the PRO polypeptide-encoding DNA is hgated into pRK5 ,^.h selected restriction en^ to allow 
,nsert.on of the PRO polypeptide DNA using ligaUon methods such as described in Sambrook et al. sj^a The 
resulting vector is called pRK5-PR0 polypeptide. 

15 In one embodtmem. the selected host cells may be 293 cells. Human 293 cells (AT€C CCL 1573) are 

grown ,c confluence in ussue culnrre plates in medium such as DMEM supplen«„ed with fetal calf serum and 
opnonany.r^tent components and/or antibiotics. About .0 « pRKi-PRO polypeptide DNA is mixed with about 
1 « DNA encodmg the VA RNA gene fThimmappaya e. a,., 31:543 (1982)J and dissolved h. 500 ^ of I mM 
Tris-HQ. 0.1 mMEDTA. 0.227 MaCl,. To thrs mixrure is added, dropwise, 500 .1 of 50 mM HEPES (pH 7 35) 

20 280m,MNaCM.5mMNaPO„andaprecipi,a.eisallowed.ofonnforIOmmutesa,25X. T.e precipice is' 
suspended and added to the 293 cells and allowed to settle for ab.«. four hours a. 3rc. Tl. culture medium is 
aspi^^d Off and 2 ml of 20. glycerol m PBS is added for 30 seconds. Tie 293 cell, are then washed with serum 
tree medtum. fresh medmm is added and the cells are incubated for about 5 days 

'■'-""^^-■--f'caons, the culn^emedhm, is removed and ^^^^ 
2. med.um (alone) or cutoe medium contammg 200 .Ci/ml "S^steine and 200 .Ci/ml "S-methtonine After a 12 
hn. ^.badon. *e ct^thdoned medhm> is collected, concentrated on a spin filter, and loaded onto a 15* SDS ge, 
Tt.e processed gel may be drted and exposed to fta for a selected period of time m reveal the presence of PRO 
po,^epM=, Tie culmres contaming transfected cells may undergo hn^r incubation (in serum free medium) and 
tte medium is tested in srlected bioassays. 

0 '"""='-^^-P«0P0«»maybei„tr0ducedinm293 celktransienrlyusm^ 

'"H^r" ™ " ^^"^ ^-^^^ 0981). 293 cells are grown to 
inaximal density ma spinner flask and 700 ii»rRK-tPBn„i. 

, . . ^'*'"^™'»'yi«P»* DNA 15 added. The cells are first concentrated 

.om^spinnerf.skbycent.i^ationandw,shedwi.hPBS.^^ 

pee. .or four hours. ^'"X' —cd with 20, glycerol for 90 seconds, washed with .issue culture medium 

ZrrT ' ""^ '-'^ 0.1 «/ml 

o .n. transfernn. After ab.t four days, the conditioned media is centrihiged and filtered to remove cell and 
d=br.s T,. sample containmg caressed PRO polypeptide can tiien be cot^nnared and purified by any selected 
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mithod, such as dialysis and/or column chromatography. 

In another cmbocmacm. PRO polypcpiidrs can be expressed in CHO cclU. 1^ pRK5-PR0 pol>pepddc 
can be oansfecied mio CHO cells u.mg known reagents such as CaPO, or DEAE-dewian. As dcscnbed above, the 
ceu cultures can be incubaied, and the medium replaced wi-J, culture medium (alone) or medium comaimng a 
radioiabei such as ^S-meduomne. After derennining the presence of PRO polypeptide, the culture medium may be 
replaced w,ch sen:m ^ medium. Preferably, the cultures are mcubared for about 6 days, and then ±t conditioned 
medium harvested. n:ed:um comainii^g the expressed PRO polypeptide can then be coocemraied and purified 
by any selected method. 

Epuope-agged PRO polypeptide may also be expressed m host CHO cells. The PRO polypeptide may be 
subctoned om of the pRK5 vector. The subclone msen can undergo PCR to fuse in frame with a selected epitope 
such as a poly-has lag into a Baculovirus expression vector. Tic poly-his tagged PRO polypeptide insen can then 
be subcloned mto a SV40 dnven vector containmg a selection marker such as DHFR for selection of stable clones. 
Finally, the CHO cells can be iransfected (as described above) with the SV40 driven vector. Ubehng may be 
performed, as described above, to verify expression. Ue culture medium containing the expressed poly-His tagged 
PRO polypeptide can then be concentrated and purified by any selected method, such as by Ni^'^helate affinity 
chrjr.iiitography. 

?RO211,PRO2n,PRO230.PRO219.PRO245,PRO22I,PRQ258,PRD30I,PRO224 PR0222 PR0234 
PR0229. PR0223. PR0328 PR0332 wc« succ«s<uJly .^r«scd in CHO cell, by bo>h . n^icm »d , «ablc 
cxpr=«iOD procata:. In iddirion, PR0232. PR0265. PR0246, PR0228, PR0227. PRO220 PR0266 PR0269 
PR0287, PR0214, PR0231. PR0233, PR0238, PR0244. PR0235. PR0236. PR0262, PR0239 PRO257" 
PR0260, PR0263. PRO270, PR0271, PR0272. PR0294, PR0295. PR0293. PR0247, PRO303 ^ PR0268 were 
s-jccissfully iiansienay expressed in CHO cells. 

Sublc expression in CHO cells was perforn«d usii« .he following procedure. The proteins were expressed 
as an IgG con^mrc, funmunoadhesin). in which d>e coding sequences for rhe soluble fo.™ (e.g. extracelM^ 
dcma^) of Uie respecnvc pro,eins were fiised .0 an IgOl consum region sequence conuining <be hinge, CH2 and 
CH2 domains and/or is a poly-His tagged form. 

Followuig PCR an^lificanon. the respecuve DNAs were subcloned in a CHO expression vector using 
sandard technques as descri-ped in At^ubel et al.. Curren, PrcccoU ofMol.culcr Biology. Unit 3.26. John Wiley 
and Sons (1997). CHO expression vectors are constructed to have con^atible lestiicdon sites 5' «xl 3' of the DNA 
of oterest to allow the convemem shuttling of cDNAV The vector used express„n in CHO cells is as described 
m Lucas « al.. t^ucl Aciis Zi: 9 (.774-1779 (.996), and uses the SV40 early protnoKr/enhancer .0 drive 
cxpresston of the cDNA of unerest and dihydrofoUte reduc-^e (DHFR). DHRl expression pennits sekcdon for 
siabl: mauitenance of the plasmid following iransfection. 

Tweh,e microgtams of the desued plasnud DNA were mtroduced into approximtely 10 million CHO cells 
ustng con^cially available nansfccdon reagents Superfect' p^gen). Dosper' or Fugene' (Boehrit^er Mannheint). 
^ ens were grown and descnbed in Uscas « al.. supra. Approxinu.e,>. 3x10" cells are frozen in an an.pule for 
fijnhcr gro^-th and production as described below. 
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The amputes comamiog the plasmid DNA were ituwed by pUcemem m» water bith and imxcd by 
vcn:.™g. Tne con^E. were pipeoed mto a ce^iifuje mbe comammg 10 mL. of media ard cemrifuged « 1000 rpm 
for 5 mmcres. The .upemaant was ajp,ra.ed and the were resuspended 10 mL of selecive media (0 2 
aiered Pr^ .-.fl, 556 0.2 Mm diafiliered feal bovine «rum). TT,e celb were men al:quo,ed inm a 100 n-i spime.- 
conaiiimg 90 mL of selecnve media. After 1-2 days, fte cells were iraiisfened m» a 250 mL spmner filled w.d, 
150 mL sekcnve growO, medium a«d mcubated at 37-C. After aether 2-3 days, a 250 mL. 500 mL axu! 2000 mL 
spnmers were seeded with 3 . 10= c=i!s/mL. Tl,e eeU media was exchanged with fresh media by cemifugation and 
resuspension in prodncuon medimn. Although any smtab.e CHO media may be employed, a p,odu=do„ medium 
descnbed inUS Patent No. 5,122.469. issued June 16. 1992 was actually used. 3L production spmner is seeded at 
1 .2 X 10' celis/mL. On day 0. the ceU nmnber pH were determmed. On day 1. the spinner was sampled and 
spaxgng With fflte^d air was commenced. On day 2. d-_- sptm^r was san^led. the temperamre shifted to 33-C and 
30 mL of 500 g.-L glucose and 0.6 mL of 10» antiioam (e.g.. 35% polydimcthyisiloxane emulsion. Dow Conting 
365 Medtcal Grade Emulston). Tlroughou, the production. pH was adjusted as necessary to keep at aromri 7 2 
After 10 days, or until viability dropped below 70S. me cell culmre wa, harvested by cemrifitguon artd filtering 
thrcMgh a 0.22 ^m eter, Ti^ filtrate was etther stored at A'C or immediately loaded onto columns for purification 
For the poly-His tagged constructs, the proteins were purified using a Ni-NTA column (Qugen) Beforr 
r^c3Bon. imidazole added to tite condiuoned media m a concentratton of 5 mM. Htc condtdoncd media was 
pumped onto a 6 ml Ni-NTA column etr.iHbra.ed m 20 mM Hepes. pH 7.4. buffer containing 0.3 M NaCl and 5 mM 
.mtdazole at a flow rate of 4-5 ml/min. at 4'C. After loading, the column was washed with ad<fid»al etpiUbradon 
buffer and the protem eiuted mth etpiUbradon buffer comaining 0.25 M inndazole. Tie Mghly purified protein was 
subsetpemly desalted mto a storage buffer containing 10 mM Hepcs. 0. 14 M NaQ and 4% manmtol, pH 6,8. with 
a 25 ml G25 Superfine (Pharmacia) column and stored at -80'C, 

immunoadhesm (Fc containing) construes of were purified from the conditioned media as foUows TTe 
cotidiocned medium was pumped onto a 5 ml Protein A column (Pharmacia) which had been equilibra^d in 20 mM 
Na phosphate buffer. pH 6.8. After loading, the column was washed extensively with e^-b„tion buffer before 
eluucn with 100 mM cmc acid. pH 3.5. "n. edited protein was imme^aiely neutraltzed by ccUecung 1 ml fractions 
..0 U.-OCS cootamMg 275 ,L of 1 M Trts bufTer, pH 9. Ute highly purified protem was subsetpendy desalted into 
storage buner as described above for dte poly-His tagged proteins. Tlte homogeneity was assessed by SDS 
polyac-ylamide gels and by N-tcrminal amino acid sequencing by Edman degradation 

PR0211. PR021-. PRO230, PR0232, PR0IJ7. PR0265, PR0219. PR0246, PR0228. PR0533 PR0245 
PR022.. PR0227, PRO220. PR025S, PR0266. PR0269, PR0287. PR02,4. PR0317, PRO301 ' PRO224' 
PR0222. PR0234, PRO^,. PR0229, PR0233. PR0238, PR0223. PR0235, PR0236, PR02*2 PRO239' 
PR0257, PRO260, PR0263. PRO270. PR0271. PR0272, PR0294. PR0295. PR0293, PR0247 PRO304' 
PRO302. PRO307, PRO303. PR0343. PR0328. PR0326, PR033.. PR0332. PR0334, PR0346' 
PRO330. PRO3,0 and PR0339 were also strccessfiUly trans,cnUy expressed in COS cell, 
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EXAMPLE 55 : EXT,TU-:<r-, of PRO Pnlvp^p^^.^ y -a r 

Tb. followmg acuwd describe rccombm^, cxpr=«ioi. of . <l««d PRO pol>.p.pude in ye«:. 

Fk., yeas. cxp..=«.on vc=«,n ar. cons:rx«i for irmclldar prcducdon or st.-redon of PRO polypcpdd.s 
fror. to ADH2/GAPDH promoter. DNA encoding > dc»:r=d PRO polyp^ud=. , s.l.=«d s.pul pcpudc aad ^ 
promotzr .™ ™ surrabh r«tncnon enzyme si.cs in the selected pU.nfd to direct intr^ceUul^ «pre„ion of 
the PRO polypeptide. For secretion. DNA encodmg the PRO polypeptide can be cloned »o the selected phsmid 
together with DNA cncodbg the ADH2/GAPDH prctnoxr. the ye.t aipha-fictor secretory srgn^der seqt^nc' 
and Imker sequences Of needed) for expression of the PRO polypeptide. 

Yeas, cells, such « yeast strain ABl 10, can then be transforwd wiLh the expression plasntids described 
aoove and culmred in selected fer^manon medta. Tne transformed yeast supematanu can be analyzed by 
precipitation with 10% trichloroaoenc acid and separation by SDS-PAGE, followed by stair>ing of the jels with 
Coomassie Blue stain. 

Recontoam PRO polypeptide can sMbsetpently be isolated and prrified by removing the yeas, cells &om 
th. fermentation medium by centrifirgation and ttren concenoadng the medium nsi^ seketed caimdge lilten The 
corxentrate contaming the PRO polypeptide may f^r be pnriBed using selecud cohnnn chromatography resins. 

Tne fonowing mahod describes recombinant expression of PRO polypeptides in Baculoviius-infcccd insect 



cells. 
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:i.e desired PRO polypeptide is firsed upstream of an epitope tag contained with a baculovirus expression 
vector. Such epitope tags include poly-his tags and m^inoglobuhn tags (lilce Fc regions of IgG). A variety of 
plasmids may be employed, includir^ plasmids denved from commercially available plasmids stich as pVLI393 
(Novagen). Briefly, the PRO polypeptide or the desired portion of the PRO polypeptide (such as the set^nce 
et^coding the ex^t^ domain of , transmembrane protein, is amplified by PCR with pruners con^lemenrary to 
^.■-5 and 3 regrons. ^ne 5' primer may incorporate flanldng (selected, restriction enzyme sites. Tleproduars 
.r.=r.. digested with those selected restriction enzymes and suoeloned into the expression vccror 

Reccmbinanr baculovirus is generated by co-transfecting the above plasmid and BaculoGold™ virus DNA 
(Pharrn^., ™o .....pr..>,.g,,„^ '"'""^ ^'^^ '> (commercially available 

from GlBCO-BRl,,. After 4-5 days of incubation a, 2^0. the released vhuses are harvesred and used L fur*er 
ampliricanoas Viral in/ection and pro«in expression is performed as described by O'Rcilley « al., Baculovirus 
expression vectors: A laboratory Manual, Oxford: Oxford Universitv Pnas (1994) 

Expressed po.y-his tagged PRO polypeptide can then be purified, for example, by N,»<helate affinity 
Chroma^ as foUows. Extiac are prepared from recombinant v,rus-infec«d Sf9 cells as described by Rc;«n 

PH ■ T::sTmc^ ^ — ^" « He" , 

n t ' ^-^-^ * ^^-0^ ^ "-i-ted rwice for 20 

« on^ on ice. Ue sonicates are cleared by centrifugation. and U.e supemaran. . dilured 5C.fold in loading buffer 
(50 m.M pnosphate, 300 n.M NaO, ,0% Glycerol, pH 7.8, and mtered thrt^gh a 0.45 ^ fiUer A N^'-NTA 
agarose column (commercially available from Qiagen, is prepared with a bed vohm. of 5 mL. washed with 25 mL 
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of arj cquilibraad wi± 25 mL of loadir>g buff«. Ti. fitecd cril cx«c U loadtd ona> i,c col-jma « 0.5 mL 
pa mmua. "n-.. coiumn vashcd lo b«.te wi* loadmg bufftr. j, which point fracuon coUccdoa is suncd. 
N«:. Lh= colmru, washed wiih a SKondai)- wa.h buffer (50 mM phosphaK. 300 mM NaCl. 107. Glycerol, pH 
6.0), w^ch du,« hompccific^LUy bound protein. Ate re^cUnj A,„b«din. again, column U developed with 
a 0 to 500 nuM toida«,le g„dient in the secondary wash buffer. One mL fr^cuons are coUected and analyzed by 
5 SDS-PAGE and silve: staining or western blot with Ni=--NTA^onjugated to attaltne phosphat^e (Qiagen) 
Fracuons comaining the eluted His,..ugged PRO polypeptide are pooled and dialyzed against loading buffer 

Aliemadvely, purificadon of the IgO tagged (or Fc tagged) PRO polypeptide can be performed usmg known 
chromatography t=chniT,ies, inclndmg for instance, Protein A or protein G column chromatography 

PR021 1, PR0217. PRO230, PR0187, PR0265. PR0246, PR0228. PR0533. PR0245. PR0221 PRO220 
10 PR0258. PR0266, PR0269, PR0287, PR0214, PRO301. PR0224. PR0222, PR0234. PR0231 ' PRO229' 
PR023i, PR0239, PR0257, PR0272. PR0294. PR0295, PR0328, PR0326, PR033I. PR0334 PR0346 .ad 
PRO310 were successMy expressed in baculovirus infected Sf9 or bigh5 insect cells. While .he expression was 
actually perfom^d m a 0.5-2 L scale, it can be readily scaled up for larger (e.g. 8 L) preparations. proteins 
were expressed as an IgG consrn« Cunm«n«dhesin), in which the protein extracellular region was fused to an IgGl 
cc..^.r. regron se<p,ence confining the hmge. CH2 and CH3 domains and/or in poly-ffis tagged forms 

FoUowing PGR amplifcadon, the respective coding ,e.^es were subcto«d into a baculovirus expression 
vector (pb.PH.IgG for IgG fitsions and pb.PH.H..c for poly-His tagged proteins), and the vector and BaoJogold. 
baculovirus DNA (Pharmingcn) were co-transfected into 105 Spo^era flus>p.r^ rStS') cells (ATCC CRL 
nil), usmg UpofectL. (Gtbco BRL). pb.PH.IgG and pb,PH.H« are modifications of the commercially available 
haculovtrus express™ vector pVU393 (Pharmingen). with modified polylmker regions to include the His or Fc tag 
sequences. Uc cells were grown in Hinlc's TNM-FH OKdium supplemented wiu. lOS FBS (Hyclone) CcUs were 
.ncubated for 5 day, at 28'C. T.e s^pemaant was harvested and subse^enUy used for the firs, viral amplification 
by infectmg Sf, cells in Kink's TNM-FH medium sttpplemented wid> m, FBS at an approximate multiplicity of 
u,fecocr.(MODoflO. Cells were incubated for 3 days a. 2S-C. T.e supernatant was harvested and the expression 
of th= const^cts in the baculovtrus expression vec»r was determined by batch binding of 1 ml of supernatant to 
25 mLof Ni-NTA beads (QUGEN) for h^tidinc tagged protems or Protein-A Sepharose C^4B beads (Pharmacia, 
for IgG tagged protems foUowed by SDS-PAGE analysis comparmg to a known concentration ofprotein standard 
by Cooraasste blue siaimng. 

TV first Viral an^lificarion snpematan. was used to infect a spinner culture (500 ml) of S/9 cells grown in 
ESF.921 mcdtum (Expression Systems LLC) at an approximate MOI of 0,1. Cells were incubated for 3 day, a, 
28-C. T^e supernatant was harvested and filtered. Batch bindmg and SDS-PAGE analysis was repeated as 
necessary, until expression of the spmner culnire wu confirmed. 

The condidoned medium f™m e>c transfectcd celU (0.5 to 3 L) was harvested by cemriftigadon to remove 
*e c is and filtered through 0.22 micron filters. For d>c poly-His tagged constnscts. the protein construct were 
purtficd usmg a N.-N-TA column (Qiagen), Before purificatio.. im.da.ole was added . the condinoned media ,0 a 
concentration of 5 mM. The condinoned medra were pumped onto a 6 ml Ni-NTA column et^libr^ed in 20 mM 
Hepes, PH 7.4. buffer containmg 0.3 M NaCl and 5 mM mudazole at a flow rate of 4-5 ml/rmn „ 4-C After 
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10^. column was washed with adicc^] .cuiffDrauon bu£fc: fl>c proam clu«d with equilibranon buff„ 
conaimr^ 0.25 M imitoic. -n^ highly punned pn,«:^ wa. suhsequcnfly d«alttd imo a s«,rage buffer conaiamg 
10 mM H=p=5, 0.14 M NaCl ami 4% ■:,ara:icl, pH 6.8. wim a 25 tJ G2J Superfine (Pharmacia) colunm aBd stored 
ai -80'C. 

Imnmcadhesin (Fc comaining) consnicE of proaiiis were purified from *e condriioned rneto as foUows. 
The condmo-ed rrredia were puxped o:«o a 5 ml Proiein A column OWacia) which had been e^oilibrated ir> 20 
mM Na phospha« b-^er, pH 6.8. After loaan|. ti« column was washed exren^ively wi± equilibration buffer before 
elution with 100 mM cioic acid, pH 3.5. Tbe elured proirin was imn«diattly r^uaalized by collecting 1 ml fraedom 
nto tubes conmnng 275 mL of 1 M Tr,s buffer, pH 9. The highly purified protein was subsequently desalted imo 
storage buffer as described above for the poly-Hu tagged proteins. T>.e honK,genei,y of the protems was verified by 
SDS polyacrylamide gel (PEG) electrophoresis and N-tenmnal amino acid sequetring by Edman degradation. 

EXAMPLE 57 : Prenar^rinn Ann^ilTI Ih ii r Bird to PRO Pn]Y; T^,irtn 

This example illmtrajes preparation of monoclonaJ antibodies which can specifically bind to a PRO 

polypeptide. 

Techniques for producing the mottodonal anybodies are known in the art a«l are described, for instance 
^ G.a.ng. ajEB. tenmogens that may be employed inchrde puri6ed PRO polypeptide, firsion proteins containing 
the PRO polypeptide, and cells expressmg recombinant PRO polypeptide on the cell surface. Selection of the 
immucogen can be made by the sldUed artisan without undue eiperimemaiion. 

Mice, such as Balb/c, are immunised with the PRO polypeptide immunogen emulsified in complete Freund s 
adjuvant and injected subcutaneously or intrapcritoneally in an amount from I-lOO micrograms. Alternatively the 
murwtogen ,s enmlsified in MPHDM adjuvant (Ribi Immmrochemical Research, Hamilton, MT) and injected imo 
the animal s hind foe, pads. Tie immunised mice are then boosted 10 to 12 days later wtth addmonal inmiunogen 
emulsified in the selected adjuvant. Iltereafler, for several weeks, the mice may also be boosted with additional 
mrmumzauoD injections, Serum samples may be periodically obtained ftom the mice by ret««,rbital bleeding for 
2d testing in EUSA assays to detect ami-PRO polypeptide antibodies. 

After a suitable antibody trter has been detected, the animals -posidve" for antibodies can be injected with 
a rmal muaventms iniecnon of PRO polj^tide. Three to four days Uter. the ntice are sacrificed and the spleen cells 
are harvested. The spleen cells are then fttsed (using 35% polyethylene glycol) to a selected murine myeloma ccU 
hnc such as P3X63AgU.l. available from ATCC. No. CRL 1597. T^e fitsions generate hybridoma cells which can 
then be plated m 96 well tissue ndnrre plate, containing HAT (hypo^nthine, aminopterin. and thymidine) medimn 
to mhibn proliferation of non-fused cells, myeloma hybrids, and spleen ceU hybrids. 

The hybridoma cells will be screened in an EUSA for reactivity against the PRO polypeptide 
Deiermt^anonof -posidve- hybridoma cells secreting the desired monoclonal antibodies agairm tbe PRO polypeptide 
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is within the skill in the an. 



Tbc pcsmve hybridoma cells can be irgected inrntpentoneally into syngeneic Balb/c mice to produce ascites 
co.^ the and-PRO polypeptide mot^ional annbtxlies. Ahemadvely. the hybridoma celU can be grown in tissue 
culmre flasks or roller bonies. Purification of the monoclonaJ antibodies produced in the ascites can be accomplished 
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using ammomm sulfate precipitauon, followed by gel exclusion chromatography. Alicraanvcly. afiory 
chromatography based upon binding of amibody lo proiem A or protein G can 'oc employed. 

EXAMPLE Ch.Tnrnr PRO pnlyrTHlr^ 

PRO polypeptides may be expressed as chL-ncnc proteins wim one or more additional poiypepade domains 
added to facilitate protein purificahon. Such purificaiicn facilitaang domms include, but are not limited to. metal 
chelating peptides such a5 histidine-tryptophan modules that allow purification on immobilized mcah. protein A 
domains that aUow purification on immobilized immunoglobulin, and the domain utilized in the FLAGS™ 
extension/afnniry purification system (Wnex Corp., Seanle Wash.). -Rie inclusion of a cleavable linker sequence 
such as Factor XA or enterokina^e anvitrogcn. San Diego Calif.) between the purification domain and the PRO 
polypeptide sequence may be useful to faciUtate expression of DNA encoding the PRO polypeptide. 

EXAMPLE 59 : Purification of PRO PnWpnr|.,, n . jn^ Snedfi. Annhrv^j . ^ 

Native or rccombmanr PRO polypeptides may be purified by a vanety of standard techniques in the an of 
protein purification. For example. pro-PRO polypeptide, mature PRO polypeptide, or prc-PRO polypeptide is 
purified by immunoaffinity chromatography usmg antibodies specific for the PRO polypeptide of interest In general, 
an immunoaffinity column is constructed by covalcmly couphng the anti-PRO polypeptide antibody to an activated 
chromatographic resin. 

Polyclonal immunogbbulins are prepared from immune sera either by precipitation with ammonium sulfate 
or by purification on irmnobilized Protein A (Pharmacia UCB Biotechnology. Piscaiaway. N.J.). Likewise, 
monoclonal antibodies are prepared from mouse ascites fhiid by ammomum sulfate precipitation or chromatogr^hy 
on immo-Diiized Protein A. Partially purified immunoglobulin is covalenily attached to a chromatographic resin such 
as CnBr-acnvated SEPHAROSE- (Pharmacia LKB Biotechnology). Tlie antibody is coupled to the resin, the lesin 
is blocked, and the derivative resin is washed according to the manufacmrer's instructions. 

Such an immunoaffinity column is udiized in the purification of PRO polypeptide by preparing a fraction 
from cells ccnraining PRO polypeptide in a soluble form. This preparation is derived by solubilization of the whole 
cell or of a subcellular fraction obtained via differential centrifugation by the addition of detergent or by other 
meth^ods well know.: in the an. AJtemaavel>-. soluble PRO polypeptide containing a signal sequence may be secreted 
ui useful quantity into the medhmi in which the cells are grown. 

A soluble PRO polypcpdde<oniaining preparation is passed over the immunoaffimty cohinm, and the 
column is washed under conditions that allow the preferential absorbance of PRO polypeptide (..^.. high ionic 
strength buffers in the presence of detergent), 'n.en. the column is eluted under conditions that dismpt antibody/PRO 
pol>Tcpride bindmg a low pH bi^er such as approxinutely pH 2-3. or a Itigh ;oncen^ation of a chao^ope such 
as urea or thiocyanate ion), and PRO polypeptide is coUectcd. 

EXAMPI,F6n : Drug Screeninp 

This mvendon is particularly useful for screening compounds by using PRO polypeptides or binding 
fragment thereof in any of a vanety of dnig screenmg techniques. Tt. PRO poln-epude or fragment employed in 
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such a lesrmay either be free m sohinon. afixed to ^ soUd suppon. bome on a cell surface, or located inaacellularly. 
One method of drjg screening uniizes cukaryouc or prokaryonc host ceUs which are stably cransfonned mih 
recombiran: nuclc:c acids expressing me PRO polypeptide or fragmen:. Drugs are screened against such uamformed 
cells in competitive bmcmg assays. Such cells. e:Lhe: in viable or f:«d form, can be used for standard bindmg 
assays, One.nuy measure, for exanyle. ±e formauon of complexes between PRO polypeptide or a fragment and the 
agent beuig rested. Akematively. one can examine ±c dimimmon in complex formahon between the PRO polypepdd: 
and its target ceU or target receptors caused by the agent being tested. 

-niQS. the present invention provides methods of screening for drugs or any other agents which can affect 
a PRO poiypepiide-assDciated disease or disorder. Ttesc methods compose contacting such an agent with an PRO 
polypeptide or fragment thereof and assaymg (T) for the presence of a complex between the agent and the PRO 
polypeptide or fragmen:, or Oi) for ibe presence of a complex between the PRO polypeptide or fragment and the ceU. 
by methods well known in the ait. In such competidve binding assays, the PRO poljpcpdde or fragment is typically 
labeled. After suitable incubation, free PRO polypeptide or fragment is separated from that present in bound form, 
and the amount of free or uncomplexcd label is a measure of the ability of the particular agent to bind to PRO 
polypeptide or to interfere with the PRO polypeptidc/ceU complex. 

Another technique for drug screening provides high throughput screening for compounds having suitable 
binding affinity to a polypeptide and is described in detail in WO 84/03564. pubUsbed on September 13. 1984. 
Briefly stated, large nonbers of different smaU peptide test compounds arc synthesized on a solid substrate. «ich as 
plastic pins or some other surface. As applied to a PRO polypeptide, the peptide test compounds are reacted with 
PRO polypeptide and washed. Bound PRO polypeptide is detected by methods weU kcown in the art. Purified PRO 
polypeptide can also be coated directly onto plates for use in the aforementioned drug screening techniques. In 
addition, non-neutralizing antibodies can be used to capture the peptide and immobilize it on the soUd suppon. 

Hus invention also contemplates the use of competitive drug screening assays in which neutralizing 
araibodies capable of binding PRO polypeptide specifically compete with a test confound for bindmg to PRO 
polypeptide or fragments thereof. In this manner, the antibodies can be used to detect the presence of any peptide 
whi:h shares one or more antigenic determinants with PRO polypeptide. 

EXA MPLE Rational Dnip T^^\ ^ 

The goal of rational drug design is to produce smictmal analogs of biologically active polypeptide of interest 
(i.e. , a PRO polypeptide) or of smaU molecules with which they interact, e.g. . agonists, antagonists, or inhibitors 
A^ of these examples can be used to fashion drugs which are more active or stable forms of the PRO polypeptide 
or which enhance or mterferc with the function of the PRO polypeptide ,n wvo (c./. Hodgson, Bion-ecbnnlop. 5. 
19-21 (1991)). 

In one approach, the three-<iimensianal structure of the PRO polypeptide, or of an PRO polypeptide-inhibitor 
compbx, is determined by x-ray crystallography, by computer modeling or. most typically, by a combination of the 
two approaches. Both the shape and 0.arges of the PRO polypeptide must be ascertained to elucidate the strucmre 
and to determine active site(s) of the molecule. Less often, useful information regarding the smicmre of the PRO 
polypeptide may be gamed by modeling based on the stnicmre of homologous proteins. In both cases relevant 
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sTT^mrJ uiJonr^on u used lo d«ign araiogous PRO polypepddc-lU.- molccuics or w idcnujy efficitni bhibitors. 
U«fJ fumplcs of r^ioial dmg d«:p, nay mclud: moitcul« which hjvt boroved .cnvio- or subility « sbowr 
by Bra«on and WcUs. aaiaMia. il:779^780! (1992) or wtoch ac: a. u±Mw:,. ago„,su. or ania|onis« of 
naove ptptidss as shown by Athauda « ct.. J. Bio:h--i llj:7«-746 (1993) 

I. .. also possible .0 ..olatc a =rB=,.sp.dn= anubody. s«l«t=d by furKnoaal assay, as described above and 
5 the. ,0 solve its crystaJ structure. This approach, m pnnctple. yields a pharraaccre upon which subsequcm drug 
design can be based. I, is possible to bypass pro«m crystallography altogether by generating atta-idtorvnic artdbodtes 
tao-ids) to . fcncrional. pharmacologically active antibody. As a tnirror image of a mirror image the bir>dii« site 
of the ann-tds would be expected to be an analog of the onginal receptor. The anti-id could dten be used to identifv 
and ^oUte peptides from banlcs of chemically or btologtcally produced peptides. The isolated peptides would then 
10 act as ihc pharmacore . 

By virtue of the present mvention. sufHcient amo.m.. of the PRO polypepdde may be made available to 
penorm such ,r.al>tical satdies as X-ray crystallography. In addidon. taowledge of the PRO polypeptide amino acid 
set^^e provided herein will provide guidance to Urose etnployirg computer tnodelir^ techniques in place of or in 

addition lo x-ray ciystaliography. 



15 



20 



25 



30 



Pat^cular and.PR03n polypeptide amibodies are useful for the diagnosis of prepathologic condiuons ar>d 
chronic or acute diseases such as gynecological diseases or ischemic disea.es which are charactered by differences 
m the amount or distribution of PR03I7. PR03n has been found to be e;tpressed it, human kidney and is thus liiely 
to be associated w.m abnotmahdes or patholog.es which affect this org«,. Further. sit«e it is so closely reUted ,o 
EBAF.l, tt ts likely to affect the endometritm, and od^er genital tissues. Further, due ,o hbrary sources of certain 
ESTs, i, appears that PR03I7 may be involved as well in forming blood vessel, and hence to be a tncduteo, of 
angiogcncsis. 

Dngnostic tests for PR03n include methods uuhzing 4c amibody and a label m detect PR0317 in human 
body fiutds. dssues, or extracts of such dssues. The polypeptide and antibodies of the present invention may be used 
witn or wtthou, modificadon. Frequeody, the polypeptide ^ antibodies will be labeled by joi^g them, either 
covalenuy or notKovalently. wtm a substance which provides for , detectable signal. A wide variety of label, and 
conjugation techniques are iotown and have been reported extensively in bod, die scientific and patcm literan^e 
Suitable label, inchide radiooichdcs. enzymes, substrates, cofactors. inhibitors, fluorescent agems, cbemiluminescent 
agents, magneuc panicles, and d,e like. Patents teaching the u.e of such labels uiclude U.S. Pa. No. 3 817 837- 
3.8J0.752; 3,939.350: 3.996.345. 4^.437; 4.275.149; and 4.366.241. Also, recombinant inunt^giobuhn.' may 
oc produced as shown in U.S. Pat. No, 4,815.567. 

A variety of protocols for measuring sohtble or membrane-bound PRo3I7. using either polyclonal or 
monoclonal antibodies specific for that PR0317. are known in the an. Exantple. it^lude enzyme-linked 
—orbeni assay (EUSA), radiounmunoassay (RU). radioreceptor assay (RRA). and fluorescent activated ceU 
son,ng (FACS). A two-site monoclotul-based immunoassay utilizing monoclonal antibodies reactive to mo 
non-mterfermg epitopes on PR0317 is preferred, but a competitive binding assay may be employed. These as^ys 
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arc dcscTiKd, among other places, in Maddox ei al J Exp. M?f^ JiS:I2n (1983). 

EXAMPLE 63 : Idenafxsnon of PR0117 Brr-prr^n 

Puriiiw: PR0317 is usiful for charactcnzadon and purificarion of specific ccU surface rrcfpwrs and o±ti 
binding molecules. Cells which respond lo PR03I7 by metaboiic changes or other specific responses arc likely to 
express a receptor for PR03n. Such receptors include, but are not limited to. receptors associated with and 
activated by tyrosine and sennc/thrconinc kinases. See Kolodzicjczyk and Hall, supra, for a review on known 
recepMrs for the TGF- superfamily. Candidate receptors for this superfamiiy fell imo two primary groups, termed 
t>'pe I and type II recepion. Both types arc scnne/thrtoninc kinases. Upon activation by the appropriate bgand. t>'pc 
1 and i>'pc D receptors physically interact lo form hetero-oUgomcrs and subsequently aoivaie intracellular signaling 
cascades, ulmnaiciy regulamg gene transcription and expression. In addition, TGF- binds to a diird receptor class, 
type m. a membrane-anchored proteoglycan lacking the kinase activity typical of signal transducing molecules. 

PR0317 receptors or other PR0317-binding molecules may be identified by interaction with radiolabeled 
PR0317. Radioactive labels may be incorporated into PR0317 by various methods known in the an. A preferred 
embodimsm is the labelmg of primary amino groups in PR0317 with '"I Bolton-Huntcr rcageoi (Bolton and Hunter, 
B i Q^^^g"'' J" f 1^3)). which has been used to label other polypeptides without concomitant loss of biological 

activity (Hcbcrt ei al., J. PM . Cbtm ,. 2fifi:18989 (1991); UzCoW ex al.. J. Immunol. . m45-5CM555 (1993)). 
Receptor-beanng ceils arc ircubatcd with labeled FR0317. The cells arc then washed to removed unbound PR0317. 
and receptor-bound PR0317 is quantified. The data obtained using different concentrations of PR0317 are used to 
calculate values for the number and affinity of receptor?. 

Labeled PR03I7 is mcfui as a reagent for purificanon of its specific receptor. In one embodiment of affinity 
pur-Jicanon, PR0317 is covalcntly coupled to a chromatography column. Receptor-bearing cells are extracted, and 
the extract is passed over the column. The receptor binds to the column by virtue of its biological affinity for 
PR03 17. Tnc receptor is recovered from the column and subjected to N-terminal protein scqucncmg. This amino add 
sequence is then us=d to design degenerate oligonucleotide probes for cloning the receptor gene. 

In zn alternative method. mRNA is obtained from receptor-bearing cells and made into a cDNA library. The 
Ubraiy is tran^fccted mto a population of cells, and diosc cells expressing the receptor are selected using fhiorescenUy 
labeled PR0317. Tte receptor is identified by recovering and sequencing recombinant DNA from highly Ubclcd 
ceiis. 

In another alternative method, antibodies arc raised against the surface of receptor bearing cells, specifically 
monoclonal antibodies. The monocbnal antibodies arc screened to idcntily those which inhibit the binding of labeled 
PR03 17 Tncse monoclonal antibodies are then used m affinity purification or expression cloning of tile receptor. 

Soluble receptors or other sohible bindmg molecules are identified in a similar manner. Ubeled PR0317 
is incubated with extracts or other appropriate materials derived from the uicrus. After incubation. PR03I7 
complexes larger than the size of purified PR0317 are identified by a sizing technique such as size-exclusion 
chromatography or density gradient centrifugation and are purified by methods known in the an. The sohible 
receptors or bmdmg protem(s) arc subjected to N-tcrminal sequencing to obtain information sufficient for database 
identification, if the soluble protein is known, or for cloning, if the sohible protein is unknown. 
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EXAMPLE 64 : Deierminanrn nf PRn^i7.Tn >i.,.. ^ cdlular R.^nnn.- 

Ttr biologicai acaviry of PR03I7 is ir.easurtd. for example, by binding of an PR0317 of the inveniion to 
an PR0317 recrptor. A i«t compound is screened as an antagonist for its abUiry to block binding of PR03 17 lo the 
receptor. A test compcnmd is screened as an agonist of ihe PR0317 for its abUity to bmd an PR0317 receptor and 
mflu=ix:e Lhi^ainc physiological events as PR03I7 using, for example, the KIRA-EUSA assay described by Sadick 
ei g/.. Ana l VU« l BiOCh?mjm 23^:207-214 (1996) in which activation of a receptor tyrosine kinase is monitored 
by immuno-captuic of the activated receptor and quanhiadon of the level of hgand-induced phosphorylation. -Die 
assay may be adapted to monitor PROBH-induced receptor activation through the use of an PR0317 reccptor-specific 
andbody to capture the acuvaied receptor. -Dicsc techniques are also applicable to other PRO polypeptides described 



herein. 



EXAMPIJt 65 : Use of PRn?.?4 for S.rrr^r(^j^^ '^"W r m rll 

PR0224 is expressed in a ceU stripped of membrane proteins and capable of expressing PR0224. Low 
density lipoproteins having a detectable label are added to the cells and incubated for a sufficient time for endocytosis. 
Tne celk arc washed. -Rie cells arr then analysed for Ubel bound to the membrane and within the cell after cell lysis. 
Detection of the low density lipoprotems within the cell determines that PR0224 is within the family of low density 
lipoprotein receptor proteins. Members found within this family are then used for screening compot:^ which affect 
these receptors, and panicularly the uptake of cholesterol via these receptors. 

£XAM£L£-^: Ability ?F0 Pplvp.nitict'; to Inhabit V.^mhr Rnrfnrh.li.l n..^n Factnr rVFOF. ^^m \^r^ 
Proliferation of Endotheli al Cell Grn^\ ^ 

-nie ability of various PRO polypeptides to inhibit VEGF stimulated proliferation of endothelial ceUs was 
tested. Specifically, bovine adrenal conical capillary endothelial (ACE) cells (from primary culture, maximum 12-14 
passages) were plated on 96-wen microtiier plates (Amersham life Science) at a density of 500 cells/weU per 100 
.L m low glucose DMEM. 10% calf serum. 2 mM glu^e. Ix pen/strept and ftmgizone. supplemented with 3 
ng/mL VEGF, Controls were plated the same way but some did not include VEGF. A test sample of the PRO 
polypeptide of mterest was added in a 100 ^1 volume for a 200 m1 final volume. Cells were incubated for ^7 days 
at 37=C. The media was aspirated and the cells, washed Ix with PBS. An acid phosphatase reaction mixmre (100 
.L,0.1Msodiumacetate.pH5.5,0.1%Triton-100. 10 ^ p-nitrophenyi phosphate) was added. After incubation 
for 2 hours at 37'C, the reaoion was stopped by addition of 10 ^1 IN NaOH. CD was measured on microtiter plate 
reader at 405 nm. Controls were no celh. cells alone, cells + FGF (5 ng/mL). ceUs + VEGF (3 ng/mL) ccUs + 
VEGF (3 ng/ml) + TGF-P (1 ng/ml). and cells + VEGF (3og/mL) + UF (5 ng/mL). aCF-P at a 1 ngAnl 
concentranon is known to block 70-90% of VEGF stimulated ceU proliferation.) 

The results were assessed by calculating the percemage inhibition of VEGF (3 ng/ml) stimulated cells 
prohfcrauon, deterrmned by measurmg acid phosphatase activity at OD405 nm. (I) relative to ceUs without 
snmdation. and H) relative to the reference TGF-P inhibition of VEGF stimulated activity. T^e result as shown 
ui Table 2 below, arc mdxcanve of the utihry of .he PRO polypeptides in cancer therapy and specific^y in inhibiting 
angiogenesis. TTie numerical values (relative inhibition) shown in Table 2 are determined by calculating the 
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pfrcsn inhioirioo of VEGF stimkied prolifcrarion by the PRO polypepiid; rslaiive to cells without siiimilztion and 
then dividing ibar perccnagc into the percent inhibiiicn obtained by TGF-p at 1 ng/m] vhich is known to block 70- 
90 o: VEGF sumula'^d cell proliferation. 
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PRO Name 


PRO Conccijffqtigtij 


Relative Inhib 


PR0211 


0.01% 


99.0 


PR0211 


0.01% 


1.09 


PR0211 


0.1% 


0.95 


PR0211 


0.1% 


67.0 


PR02n 


1.0% 


0.27 


PR 02 11 


1.0% 


20.0 


PRa217 


0.01% 


1.06 


PR0217 


Q.1% 


0.84 


PR0217 


1.0% 


0.39 


PR0217 


2.5 fiM 


0.2 


PR0217 


25 nM 


0.88 


PR0217 


250 nM 


0.58 


PR0187 


0.01% 


0.91 


.RC1S7 


0.1% 


0 82 


PP0187 


1.0% 


0.44 


PR0219 


5.7 


0.61 


PR0219 


57 nM 


1.09 


PR0219 


570 nM 


0 97 


PR0246 


0.01% 


1.04 


PR0246 


0.1% 


l.O 


PR0246 


1.0% 


0.49 


PR0228 


0.01% 


0.99 


PR0228 


0.1% 




PR0228 


1.0% 


0 57 


PR0228 


0.01% 


0.95 


PR0228 


0.01 % 


0.98 


PR0228 


0.1% 


0.77 


PR022S 


0.1% 


0 8R 


PR0228 


1.0% 




PR0228 


1.0% 


0 4!^ 


PR0245 


0.01% 


0 7fi 


PR0245 


0.1% 


0 35 


PR0245 


1.0% 


0 1 1 


PR0245 


0.48 nM 




PR0245 


4.8 nM 


0.95 


PR0245 


48 nM 


0.49 


PR0221 


0.01% 


1.03 


PR0221 


0.01% 


1.06 


PR0221 


0.1% 


0.82 


PR0221 


0.1% 


0.93 


PR0221 


1.0% 


0.31 


PR022: 


1.0% 


0.43 


PR0258 


0.01% 


0.98 


PR0258 


0.01% 


1.06 


PR0258 


0.1% 


0.95 


PR0258 


0.1% 


1.02 


PR0258 


1.0% 


0.6 


PR0258 


1.0% 


0.69 
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Tablg 2 com' 



PRO NiT" 


PRO Coticcntrarion 


Rrla"V' Inhibit 


PRO301 


7.0 /iM 


1.02 


PR03Q1 


70 iiM 


0.88 




700 


0 44 




0 0] 




FRO30] 


O.l % 


0.85 


PR03Q1 


1.0% 


0.68 




0.01% 






0.1 % 




DO 0124. 


1.0 7e 




PR 027^ 


0 01 % 






0,1 % 




PR0272 


1.0% 


0.18 


PR0328 


0.01% 


0-98 


PR0328 


0.1% 


0.96 


PR0328 


1.0% 


0.6 


PR033I 


0.01% 


0.88 


PR0331 


0.1% 


0.82 


PR033! 


1.0% 


0.56 
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15 



20 

F-X AMPLE 67 : RfflPft] NgUfPTl Survival 

Tnis example dcmonsn^tes ihai PRO220 polypeptides have efficacy in enhancing the survival of retinal 
neuron cells. 

25 Spiaguc Dawlcy rai pups at posmatai day 7 (mixed popuiatioD: glia and retinal neuronal types) are killed 

by decapitation following COj anesthesia and the eyes are removed under sterile conditions. The neural retina is 
dissected away from the pigment epithelium and other ocular tissue and then dissociated into a single cell suspension 
using 0.25% trypsin in Ca'% Mg^*-frce PBS. The rttinas arc incubated at 37"C for 7-10 minutes after which the 
trypsin is mactivatcd b>- adding 1 ml soybean trypsin inhihitor. The cells are plaied at 100,000 ceUs per well in 96 

30 v/cD plates in DMEM/F12 supplemrmed with N2 and with or without the specific test PRO polypeptide. Cells for 
all cxp:nmcnis are grown at 37°C in a water saturated annosphcre of 5% COj. After 2-3 days in culture, cells arc 
staiEcd with calccin AM then fixed using 4 % paraformaldehyde and stained with DAPI for detenninaiion of total cell 
ccun:. The total cells (fluorescent) are quantified at 20X objective magnification using CCD camera and NIH image 
software for Macintosh. FicMs in ihe well arc chosen at random. 

3 5 Th= effect of various concentration of PRO220 polypq)tides are rcporwd in Table 3 below where percent 

surv ival is calculated by dividing the total number of calccin AM positive cells at 2-3 days in culture by the total 
number of D API-labeled cells at 2-3 days in culture. Anything above 30% survival is considtrtd positive. 

Table 3 

40 FRO N^e PRO ConcentrafTtjn Percent Survival 

PRO220 0.01% 2.4% 

PRO220 0.01% 4.1% 

PRO220 0.1% 3.0% 

PKO220 0.1% 3.1% 

45 PRO220 1.0% 72.4% 

PRO220 1.0% 42.1% 
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EXAMPLE 6B - Rod Phcrrr^^cecigr Survival 

Tbs exaraple demonsc-atcs tha: PRO220 polypcpiidcs have efficacy m enhancing the suivivaj of rod 
photoreceptor cells. 

Sprague Dawie>' ra: pups ai 7 cay posmatal (mbted population: glia and rednal neuronal cell rypes) are killed 
by cecapi-jucn foUowing CO, anesthesis and the eyes are removed under sterUe conditions. The neural retma is 
dissected away foim the pigment epiihehmn and other ocular tissue and then dissociated imo a single cell suspension 
using 0.25 % trypsin in Ca'", Mg'*-free PBS. The retinas are incubated at 37'C for 7*10 minutes after which the 
trypsin is macuvaicc by adding 1 ml soybean trypsin inhibitor. The celk are plated at 100.000 cells per well m 96 
wcU plates in DMEM/FI2 supplemented with N2 and with or without the specific test PRO polypeptide. Cells for 
ail expennmis arc grown at 37' C in a water samrated atmosphere of 5 % C02. After 2-3 days in culture, ccUs art 
fixed using 4fc paraformaldehyde, and then stained using CeUTracker Green CMFDA. Rho 4D2 (ascites or IgG 
1:100), a monoclonal antibody direoed towards the visua! pigment rhodopsin is used to detect rod photoreceptor cells 
by indirect hrnminofluoresccnce. The results are reported as % survival: total number of caiccin/CcUTracker - 
rhodopsin positive cells at 2-3 days.in cuinire, divided by the total number of rhodopsin positive cells at time 2-3 days 
L-i ciJiure. The total cells (fluorescent) arc quantified at 20x objective magnificaiion using a CCD camera and NIH 
image sofr^'are for Macintosh. Fields in the well arc chosen at random. 

The effect of vanous concemration of PRO220 polypeptides are reported in Table 4 below. 'Anything above 
10% survival is considered positive.. 



Tabic 4 

PRO Nam? PRC Con;rniriirpn Percent Survival 

PRO220 0.01% 0 0% 

PR^220 0.1% 0 0% 

PRO220 2.0% 0^0% 

PRO220 10% 0 0% 

PRO220 20% 66.9% 

P^^O220 1.0% 56.9% 

eXAMPLE 69 : Induction of Endotheli;^! r^)] j^-^ r ^^ ^ ^]^ 

The ability of PR0228 polypeptides to induce apoptosis in endothelial ceils was tested in human venous 
umbiiicaJ vein endothelial ceils (HUVEC, Cell Systems), using a 96-weU fonnat. in 0% serum media si^lemcnied 
with 100 ng/mi VEGF. (As HUVEC cells are easily dislodged from the plating surface, ail pipetting in the wells 
must be done as gently as practicable.) 

The media was aspired and the cells washed once with PBS. 5 ml of 1 x trypsin was added to the cells in 
a T-I75 flask, and the cells were allowed to stand unnl they were released from the plate (about 5-10 mimics). 
Trypsmizaiion u-as stopped by adding 5 ml of growth media. The cells were spun at 1000 rpm for 5 minutes at 4'C. 
The media was aspirated and the cells were resuspended m 10 ml of 10% serum complemented medium (Cell 
Systems), 1 x penn/strep. 

The cells were plated on 96-wcU microtite: plates (Amcrsham Life Science. cytostar-T scintillating 
microplate. RPNQ160, sterile, tissue-culture treated, individually wrapped), in 10% serum (CSG-medium, CeU 
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Syscnarj). at a der^iiy cf 2 x 10' ceUs pe: well in a totai volume of 100 ^. The PR0228 polypcptjde was added m 
triplicaie at dibnons ofl 7o . 0 33 % and 0.1 1 % . Wells wi'J^oui ceUs were used as a blank and wells wiOi cells only 
as a neganvc control. Aj a posinve control 1:3 serial dilutions of 50 ^L\ of a 3x stock of stzurosponne were used. 
The abiliiy of \hz PR0228 Fol>'pepQdc to induce apopiosis was dcitnnined using Anncxin V. a member of die calcium 
and phospholipid binding proteins, to deiecr apoptosis. 
5 0.2 ml Acnexm V - Bictiij stock solution (100 fig/ml) were diluted in 4.6 ml 2 x Ca" binding buffer and 

2.5% BSA (1:25 diludon). 50 ^ils of the diluted Annexin V - Biodn solunon were added to each well (except 
controk) to a final concemrauon of 1 .0 ;ig/ml. The samples were incubaied for 10-15 minmes with Annexin-Biotin 
pnor to dtreci addition of '=S-Streptavidin. "S-Sirepiavidia was diluted in 2x Ca'* binding buffer, 2.5% BSA and 
was added to alJ weUs at a final concemraiian of 3 x 10' cpmAvell. The plates were then scaled, cenmfiiged at 1000 
10 rpm for 15 minutes and placed on orbital shaker for 2 hours. The analysis was performed on 1450 Microbcta Tritux 
CWallac). Thz results are shown in Tabic 5 beiow where perceni above background represents the percentage amount 
of counts per minute above the negative controls. Perccnts greater than or equal lo 30% above background are 
cocjsidrrcd positive. 



15 Ta^jk 2 

^aP^? PRO Con^Cr . Traiio j Percent Above Raclcprnimri 

PR0228 0.11% 

PR0228 0.11% 47 6% 

PR0228 0.33% 92 2% 

20 PR0228 0.33% 123 7% 

PR0228 1.0 % 514% 

PR0228 1.0% 95 3 

EXAMPLE 70 : PDB12 Ce]] Trhihmnn 

25 This example dcmonstratts thai various PRO polypeptides have efficacy in inhibiting protein production by 

PDBI2 pancreatic ductal cells. 

PDBI2 pancreanc ductal cells are plated on fibronectin coated 96 well plates at 1 .5x10^ cells per well in 100 
mL/1 80 of growth media. 100 of growth media with the PRO polypcpdde test sample or negative control 
lackmg the PRO polypeptide is then added to weU. for a final volume of 200 (iL. Controls contain growth medium 
coi:aining a proxin shown to be inactive in diis assay. Cells are incubated for 4 days at 37^. 20 ^ of Alamar Bhie 
Dye (AB) is then added to each well and the floureicent reading is measured at 4 hours post addition of AB, on a 
microtiter plate reader at 530 nm excitation and 590 nm emission. The standard employed is cclb without Bovine 
Pituitary Extract fBPE) and with various concentrations of BPE. Buffer or CM controls from unbK)wns arc run 2 
limes on each 96 well plate. 

Tne results from these assays are shown in Table 6 below wherein percent decrease in protein production 
is calculated by con?,anng the Aiamar Blue Dye calculated protein concentration produced by the PRO polypeptide- 
treated cells with the Alamar Blue Dye calculated protein concentration produced by the negative control cells. A 
percent decrease in protem production of greater dun or equal to 25 % as compared to the negative control cells is 
considered positive. 



30 
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Table 6 



PRO 


FRO Con;?inraRt;n 


Pcrcrm Dtt^r^z m P:otti;i 


PR0211 


0.1% 


0.0% 


PR02U 


0.01% 


0.6% 


PR0211 


1.0% 


59.7% 


5 PR0287 


2.0% 


22.3% 


PR0287 


10% 


18^% 


PR0287 


50% 


67^% 


PR0287 


2.0% 


45^3% 


PR0287 


10% 


57.3% 


10 PR0287 


50% 


52.24% 


PRO301 


2.0% 


0.0% 








PRO301 


50% 


65.6% 


PRQ295 


2.0% 


0.0% 


15 PR0293 


10% 


40.4% 


PR0293 


50% 


56.7% 


EXAMPLE 71 


Stimulaiion of Adult Hean HvT>enrnphv 





This assay is designfd lo measure the abiUry of various PRO polypeptides to smnulaie hypertrophy of adult 

20 hcan. 

Vcntri::ular myoc>'tes freshly isolated from adult (250g) Sprague Dawlcy rats are plated st 2000 cell/well 
in 180 (il volume. Cells are isolated and plaed on day I, the PRO poiypepude-contaimng icsi samples or growili 
mcditm only (negative control) (70 ^1 volume) is added on day 2 and the cells arc then fixed and sained on day 5. 
After staining , cell size is visualized wherein tcUs showing no growth cnhancemcni as compared lo comrol celis are 
25 given a value of 0.0. cells showing small to moderate growth cnhai^ement as compared lo control cells are given 
a value of 1 .0 and ceils sbowmg large growth enhancemcnr as compared to control cells are given a value of 2.0. 
Ar.y degree of growth enhancement as compared to the negative control cells is considered positive for the assay. 
The results are shown m Table 7 below. 



30 Tablg 7 

PRO Nam; PRO OonCCmranfTn Growth Enhancemem Score 

PR0287 20% 1.0 

PR0287 20% 1.0 

PRO30I 20% 1.0 

35 PRO301 20% 1.0 

PR0293 20% 1.0 

PR0293 20% 1.0 

PRO303 20% 1.0 

PRO303 20% 10 

40 

EXAMPLE 72: PDB12 Cell Prnlifrr.rinn 

This example demonstrates Urn various PRO polypeptides have efficacy in inducing pToUferation of PDB12 
pancreatic ductal ceUs. 

FDB12 pancreanc ductal cells arc plated on fibrontctin coated 96 well plates at 1 .5x10^ cells per well in 100 
45 tiL/180 fiL of growLh media. 100 of growth media with the PRO polypeptide test sanqile or negative control 
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lacldag PRO polypcptid: i5 ihrn added lo well, for a fir^ volume of 200 fd.. Conirols ccFnuin growth mcdiuni 
coniairjng a prottin shown to be inactive in this assay. Cells zrz inrjbated for 4 days ai 37°C. 20 ^ of Alamar Blue 
Dye (AB) is ihen added lo each well and che flourescent reading is measured at 4 hours post adiiiion of AB. on a 
microdier plaic reader ai 530 nm exwitauon and 590 nm emission. The standard employed is cells without Bovine 
Pituitary Extract (BPE) and with %-arious concentrations of BPE. Buffer or growth medhira only comrols from 
5 unknowns are run 2 times on each 96 well plate. 

The resulLs from these assays arc shown in Table 8 below wherein percent increase in protein producnon 
is calculated by comparing the Alaaiar Blue Dye calculated protein conccDtiation produced by the PRO polypcpiidt- 
treated celJs with the Aiamar Blue Dye calculated protein concentration produced by the negative control cells. A 
pcrccn: increase in pro:ein production of greater than or tquai to 25% as compared to the negative control cells is 
10 considered positive. 



Table 8 

PEjQiiame PRO Cc>nt;;naaaQn Pcrgtm Incrca;c in Proicin Proitoftn 

PRO301 2.0% 44.0% 

15 PKO301 10% 67.4% 

PRO301 50% 185.8% 

PRO303 2.0% 27.9% 

PRO303 10% J 74. 9% 

PRO303 50% 193.1% 

20 

EXAMPLE 73: Enhancement of Hean Neonatal H vpertrophv Induced bv PR0224 

This assay is designed to measure the ability of PRD224 polypeptides to stimulate hypertrophy of neonatal 

heart. 

Cardiac myoc>ics from 1-day old Harlan Sprague Dawlcy rats were obtained. Cells (180 fd at 7.5 x 10*/ml. 

25 scram <0. 1 % , &cshiy isolated) are added on day 1 to 9&-wcU plaM;s previously coaled with DMEM/F12 +4% FCS. 
Test samp-es containing the test PRQ224 polypeptide or growth medium only (hegativc control) (20 /J/ well) arc added 
d:rectJy to the wells on day 1. PGF (20 >iI/welJ) is then added on day 2 a: final concentration of ICT* M, The cells 
art then stained on day 4 and visually scored on day 5, wherein cells showing no increase in size as compared to 
negative coraols are scored 0.0, cells showing a small to moderate increase in size as compared to negative comrols 

30 are scored 1 .0 and cells showing a large increase in size as compared to negative comrols arc scored 2.0. The results 
are shown in Table 9 below. 



Table 9 

PRO ^^ a mc PBC> Cgnc^ntrflnpn Qrow^ti EnbanKiRgm Scorg 

35 PR0224 0.01% 0.0 

PR0224 0.1% 0.0 

PR0224 1.0% 1.0 

E>L^MPLE 74: In situ Hvhririi7.anpp 
40 In shu hybridizatioa is a powerful and versatile technique for the deieciion and localization of nucleic acid 

sequetKcs wiLhm cell or tissue preparaiiom. It may be useful, for e^plc, to identify sites of gene expression. 
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anaiyzc the disuc disribution cf miiscr^aon, identify and lociiiz; viral infccnon, follow changes in specific mRNA 
s>Ticnesis and aid in chiomosonis mapping. 

In siTj ii>'bndizahon was ptrfomicd following an optimized version of the protocol by Ui and GiUer. CsU 
Vision 1:169-176 (1994), using PCR-gcncrated ^'P-labtled riboprobts. Bricfi>'. formalin-fixed, paraffin-embedded 
human tissues were sectioned, dcparaffinized, dcproieinaied in proteinase K (20 g/ml) for 15 minu'^ at 37'C, and 
5 further processed for in sim hybridizanon as described by Lu and Gillec, supra. A ["-PI UTP-labeled anrisense 
nboprobc was generated from a PCR product and hybridized at 55'C overnight. The slides were dipped in Kodak 
NTB2 nuclear track emulsion and exposed for 4 weeks, 
"P-Riboprobe synthesis 

6.0 /il (125 raCi) of "P-UTP (Amcrsham BF 1002, SA<2000 Ci/mraol) were speed vac dried. To each 
10 tube containing dried '^P-UTP, the following ingredients were added: 
2.0 5x transcription buffer 
1.0 nl DTT (100 mM) 

2.0 ^l\ NTP mix a.5 mM : 10 m: each of 10 raM GTP, CTP & ATP + 10 ^1 HjO) 
1.0m1UTP(50mM) 
15 1 O ;il Rnasin 

1 .0 Ml DN A template (l^xg) 

1,0 ^1 RNA polymerase (for PCR products T3 = A5, T7 = S, usually) 

The mbes were incubated at 37°C for one hour. 1 .0 ^1 RQi DNase were added, followed by incubation 

20 at 37'C for 15 minutes, 90 fi\ TE (10 nLM Tris pH 7.6/lmM EDTA pH 8.0) were added, and the mixture was 
pipcticd onio DE81 paper. The remaining solution was loaded in a MicTOCon-50 ultiaiiltration unit, and spun using 
program 10 (6 minuxs). The filtration unit was tnvened over a second mbe and spun using program 2 (3 minutes). 
After the final recovery spin, 100 fil TE were added. 1 of the final product was pipetted on DE81 paper and 
counted in 6 ml of Biofluor IT. 

25 The probe was run on a TBE/urea gel. 1-3 mI of the probe or 5 ;J of RNA Mrk ID were added to 3 mI of 

loading buffer. After heanng on a 95 "C heat block for three minntes. the gel was immediately placed on ice. The 
wtHs of gel were flushed, the sanplc loaded, and run at 180-250 volts for 45 minutes. The gel was wrapped in saran 
wrap and exposed to XAR film with an intensifying screen in -70*'C freezer one hour to ovcmight. 
"P-Hvhridt7atinn 

30 A, Pretreatment of frozen %f.^inn^ 

The slides were removed from the freezer, placed on aluminium trays and thawed ai room temperature for 
5 mumics The trays were placed in 55'C incubator for five minutes to reduce condensation. The slides were fixed 
for 10 mmutes in 4% paraformaldehyde on ice in the fume hood, and washed in 0.5 x SSC for 5 minutes, at room 
temperature a5 mi 20 x SSC -t- 975 ml SQ H^O). After deprotcinatitm in 0.5 ^g/ml proteinase K for 10 minutes 

35 at 37-C (12.5 ^1 of 10 mg/rai stoclc in 250 ml prcwanncd RNasc-free RNAsc buffer), the sections were washed in 
0,5 X SSC for 10 minutes at room temperature. The sections were dehydrated in 70%, 95%, 100% ethanol, 2 
minutes each. 
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B Prsjcarxen: of oaraffin^m hedded ^r^rions 

Th; slices were deparafGnized. placed in SQ K;0, and rinitc twice in 2 x SSC ai room temperaaire. for 
5 rruauies each vsnc. The sections were dcproicinaied in 20 ^g/mi proteinase K (500 fA of 10 mg/ral in 250 ml 
RNasc-frct RNase buffer; 37°C. 15 nmmtes) - human embryo, or 8 x proteinase K (100 ^1 in 250 ml Rnase buffer, 
37''C, 30 rainuics) - formalin nssues. Substqucni rinsmg in 0.5 x SSC and dehydration were performed as described 
5 above. 

c. PrciJYtjriqizaupn 

Tne slides were bid out in a plastic box lined with Box buffer (4 x SSC, 50% formamide) - saniraied filter 
paper. The tissue was covered with 50 ^^l of hybridization buffer (3.75g Dexiran Sulfate + 6 ml SQ HjO), voncxcd 
and heated in the microwave for 2 mimics with the cap loosened. After cooling on ice, 18.75 ml formamide, 3.75 
10 ml 20 X SSC and 9 ml SQ HjO were added, tiie tissue was vonexed well, and inoibarcd at 42*'C for 1-4 hours. 

1.0 x 10' cpra probe and 1 .0 mI ^NA (50 mg/ml stock) per slide were heated at 95 "C for 3 minutes. The 
slides were cooled on ice. and 48 ^il hybridization buffer were added per slide. After vonexing. 50 ^ll "P mix were 
added to 50 ^1 prchybridizaiion on slide. The slides were incubated overnight at 55*C. 
15 E. Washes 

Washing was done 2x10 minutes with 2xSSC, EDTA a: room tcn^jcramre (400 ml 2ax SSC + 16 ml 
0.25M EDTA, Vf=4L}, followed by RNascA treatment at 37*C for 30 minutes (500 m1 of 10 mg/ml in 250 ml Rnase 
buffer = 20 Mg/ml). The slides were washed 2 x 10 minutes with 2 x SSC, EDTA at room temperamre. The 
stringency wash conditions were as foUows: 2 hours at SS'C. 0.1 x SSC. EDTA aO ml 20 x SSC + 16 ml EDTA. 
20 V=4L). 

F. Oligomclent^dffs 

In Sim analysis was performed on a variety of DNA sequences disclosed herein. The oligonucleotides 
employed for these analyses are as follows. 

(1) DN.A33Q9^n3TrPRn2TA 

25 p: 5 -GGATTCTAATACGACrCACTATAGGGCTCAGAAAAGCGCAACAGAGAA-3' (SEQ ID NO:348) 
p2 5 ' 'CTATGAAATTAACCCTC ACTAAAGGGATGTCTTCC ATGCCAACCTrC-3 ' (SEQ ID NO:349) 

(2) DNA33223-n3fi rPRn7.^m 

pi 5'-GGArrCTAATACGACTCACTATAGGGCGGCGATGTCCACTGGGGCTAC-3' (SEQ ID NO:350) 
30 p2 5'-CTATGAAATrAACCCrCACrAAAGGGACGAGGAAGATGGGCGGATGGT-3' (SEQ ID NO:351) 

(3j DNA34435-n40fPRm^?'t 

pi 5'-GGATTCTAATACGACTCACTATAGGGCACCCACGCGTCCGGCTGCTT-3' (SEQ ID NO:352) 

p2 5'-CTATGAAATr.\ACCCTCACTAAAGGGACGGGGGACACCACGGACCAGA-3 ' (SEQ ID NO:353) 

35 
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(4) DNA35639.I172 fPRQ246i 

pi 5'-GGATTCTAATACGACrCACTATAGGGCTTGCTGCGGTTnTGTTCCTG^3" (SEQ ID NO:354) 

p2 5'-CTATGAAArr.AACCCTCACTAAAGGGAGCTGCCGATCCCACTGGTATT-3' (SEQ ID NO:355) 

(5) I>NA^943;-1219 fPRQS33^ 

5 pi 5 -GGATTCTAATACGACTCACTATAGGGCGGATCCTGGCCGGCCTCTG-3' (SEQ ID NO:356) 

p2 5'-CTATGAAATTAACCCTCACrrAAAGGGAGCCCGGGCATGGTCTCAGTrA-3' (SEQ ID NO:357) 

(6) DNA3563S.n41 rPR0245^ 

pi 5 -GGATTCTAATACGACTCACTATAGGGCGGGAAGATGGCGAGGAGGAG-3' (SEQ ID NO;358) 

10 p2 5 -CTATGAA.i.TTAACCCTCACTAAAGGGACCAAGGCCACAAACGGAAATC-3' (SEQID NO:359) 

(7) DNA33089-1132 fP^m7U 

pi 5 '-GGATTCTAATACCACTCACTATAGGGCTGTGCTTTCATrCTGCCAGTA-3' (SEQ ID KO:360) 

p2 5 -CTATGAAATTAACCCTCACTAAAGGGAGGGTACAATTAAGGGGTGGAT-3' (SEQ ID NO:361) 

15 

(8) nN'A3591R-n74 cp^mm 

pi 5 -GGATTCTA.ATACGACTCACTATAGGGCCCGCCTCGCrrCCTGCTCCTG-3' (SEQ ID NO:362) 

p2 5 -CTATGAAATTAACCCrrCACTAAAGGGAGGATTGCCGCGACCCTCACAG-3' (SEQ ID NO:363) 

20 (9) DNA32286-n9i n>nm\4\ 

pi 5 '-GGATTCTAATACGACTCACTATAGGGCCCCTCCTGCCTTCCCTGTCC-3' (SEQ ID NO:364) 

p2 5'-CTATGAA.MTAACCCTCACT.\AAGGGAGTGGTGGCCGCGATTATCTGC-3' (SEQ ID NO:365) 

(10) DNA33221-ir,-t rp^mod^ 

25 pi 5 -GGArrCTAATACGACTCACTATAGGGCGCAGCGATGGCAGCGATGAGG-3' (SEQ ID NO;366) 

p2 5 -CTATGAAATTAACCCTCACTAAAGGGACAGACGGGGCAGAGGGAGTG-3' (SEQ ID NO:367) 

(11) DNA35557-n37 rPR0234'> 

pi 5'-GGATTCTAATACGACTCACTATAGGGCCAGGAGGCGTGAGGAGAAAC-3' (SEQ ID NO:368) 

30 p2 5'-CTATGAAATrAACCCTCACTAAAGGGAAAGACATGTCATCGGGAGTGC;-3" (SEQ ID NO:369) 

(12) DNA3310Q-n59 (PRQ229^ 

pi 5'-GGATrCTAATACGACrrCACTATAGGGCCGGGTGGAGGTGGAACAGAAA-3' (SEQ ID NO:370) 

p2 ^"-CTATGAAATTAACCCTCACTAAAGGGACACAGACAGAGCCCCATACGC-3' (SEQ ID NO:371) 

35 
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pi 5'-GGATrCTAATACGACTCACTATAGGGCCAGGGAAATCCGGATGTCTC-3' (SEQ ID NO:372) 
p2 5 -CTATGAAATrAACCCTCACTAAAGGGAGTAAGGGGATGCCACCGAGTA-3' (SEQ ID NO:373) 

(14) DNA^R2fiS-nSS fPPn-?05^ 

pi 5'-GGATr(rrAATACGA(rrCACTATAGGGCCAGCTACCCGCAGGAGGAGG-3' (SEQ ID NO;374} 
p2 5'^ATG.AAArrAACCCTCACTAAAGGGATCCCAGGTGATGAGGTCCAGA-3' (SEQ ID NO;375) 

G. Result s 

In Sim analysis was perfonned on a variety of DNA sequences disclosed berein. The results from these 
analyses arc as follows. 
CD DNA3309^-im fppmn) 

Highly distinctive expression pattern, that docs not indicate an obvious biological function. In the human 
embryo it was expressed in outer smoo'Ji muscle layer of the GI tract, respiratiry cartilage, brancbmg respiratory 
epithrLum, osteoblasts, tendons, gonad, in the optic nerve head and developing dermis. In the adult expression was 
obsei^eu in the epidermal pegs of the chimp K)nguc, the basal epithcliai/myocpi±elial cells of the prosuie and urinary 
bladder. Also expressed m the alveolar lining cells of the adult lung, mesenchymal cells juxtaposed" io erectile tissue 
in the penis and the cenrbral cortex (probably glial cells). In the kidney, expression was only seen in disease, in cells 
surrounding thyroidized renal tubules. 

Hu™ fCBl iisSUC? gJtanmcd fR12-EI^ wfrKM Placenta, umbilical cord, liver, kidney, adrenals, thyroid, 

lungs, heart, great vessels, oesophagus, stomach, small intestine, spicen. thymus, pancreas, brain, eye, spinal cord, 
body wall, pelvis and lower hmb. 

hwW h-^;r . ar . nssycs ZT ^ m i mt Kidney (normal and ead-stage). adrenal, myocaidhun, aorta, spleen, lymph node, 
gall bladder, pancreas, lung. skin, eye (inc. retina), prostate, bladder, liver (normal, cirrhotic, acute failure). 
Non-human primaie tis ?!ues eicanijn yj- 

(a) Ch i ntg Tissues : SaUvary gland, stomach, thyroid, parathyroid, skin, thymus, ovary, lymph node. 
fi^?^us MpnKev Tmv:.^: Cerebral cortex, hippocampus, cerebcUum, penis. 

C2) DNA33223-infi fPRH?^) 

Sections show an intense signal associated with arterial and venous vessels in the fetus. In aneries the signal 
appeared to be confined to smoctb-muscle/pcricytic cells, lb: signal is also seen in capillary vessels and in glomenili. 
It is not clear whether or not cndotheUal cells are expressing this mRNA. Expression is also observed in epithelial 
cells in the feral lens. Strong expression was also seen in cells within placental trophoblastic villi, ibesc cells he 
between the trophoblast and the fibroblast-like cells that express HGF - uncertain histogenesis. In the adult, ihcrt was 
no evidence of expression and the wall of the aoru and most vessels appear to be negative. However, expression was 
seen over vascular channels in the normal prostate and in the epithelium lining the gallbladder. Insurers expression 
was seen m the vessels of the soft-tissue sarcoma and a renal ceU carcinoma. In summary, this is a molecule that 
shows relatively specific vascular expression in the ferns as wefl as in some adult organs. Expression was also 
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observed m the fcLal lens and the adult gallbladder. 

Id a secondary screen, vascular expression was observed, suniiar to that observed above, seen m fetal 
b:ocks. Expression is on vascular smooch muscle, rather than endothelium. Expression also seen in smooth muscle 
cf the developing oesophagus, so as reported previously, thxs molecule u not vascular specific. Expression was 
examined m 4 lung and 4 breast carcinomas. Substantial expression was seen in vascular smooth muscle of at least 
3/4 hmg cancers and 2/4 breast cancers. In additioii. in one breast carcinoma, expression was observed in perirumoral 
stromal cells of uncenain histogenesis (possibly myofibroblasts). No cndothcUal cell expression was observed in this 



studv. 



(3) DNA34435.n4n rPRmi?) 

Strong expression in prostatic epithclhim and bladder epiibciimn. lower level of expression in bronchial 
cpiihdium. High background / low level expression seen in a mimber of sites, including among others, bone, blood, 
chondrosarcoma, adult hean and fctai iivcr. It is felt that this level of signal rcprcscnis background, partly because 
signal at this level was seen over the blood. AJ] other tissues negative. 

hmnan iW \ n s su;? :7i a m i ncd f£I2-E16 wrekO inrlnH^ - Phcenu. umbilical cord, liver, kidney, adrenals, thyroid, 
fungi, heart, great vessels, oesophagus, stomach, small intestine, 
spleen, thymus, pancreas, brain, eye, spinal cord, body wall, pelvis, testis and lower limb. 
Adu i t human tew C^aiK i nc^ j: Kidney (normal and end-stage), adrtnal, spleen, lymph node, pancreas, lung, eye One. 
rcdna). bladder, liver (normal, cirrhotic, acute failure). 
Non-human primate tissues examinftri- 
Chimp Ti'i'iuer adrenal 

HJlCSy? Mi?nKgv llmr^: Cerebral concx, hippocampus 

In a secondary screen, expression was observed in the epithelium of the prostate, the superficial layers of 
the urcthriium of the urinary bladder, the urethelhim lining ±e renal pelvis and the urethelhim of the ureter (1 out 
of 2 cxpenmenn). The urcdira of a rhesus monkey was negative; it is unclear whether this represcms a true lack of 
c^re^ion by 'he urcdira. or if it is the resuh of a failure of the probe to cross react with rhesus tissue. The findings 
in the prostate and bladder are similar to diose previously described using an isotopic detection technique. Expression 
of the mRNA for this anugen is NOT prosute epithelial specific. The amigen may serve as a useful marker for 
uretbelial derived tissues. Expression in the superficial, post-mitotic cells, of the urinary- tiact epithelium also suggest 
that it IS unlikely to repre^em a specific stem ceU marker, as this would be expected to be expressed specifically in 
basal epitheliimi. 

(4) DN'A35639-]:7? ^F^^ 

Strongly expressed in fetal vascular cndotheimm, including tissues of the CNS. Lower level of expression 
in adult vasculature, mchidmg the CNS. Not obviously expressed at higher levels in tumor vascular endothelium. 
Signal also seen over bone matrix and adult spleen, not obviously cell associated, probably related to non-specific 
background at these sites. 

Kumm fcBl n.^,^r^ c?taining^ l rFi?-F16 wc;1ta) inrll l fir : Placenta. umbiUcal cord, liver, kidney, adrenals, thyroid. 
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Imgs, bean, grtai vesscli, otsopbagus, sumiach. %mall uucsdnc, spleen, [hyir.us. pancreas, brain, eye. spinal cord, 
body wall, pelvis, testis and lower limb. 

Adult human tissues exannnc j: Kidacy (normal and end-stage), adrer^. spleen, l>'iTTph node, pancreas, lung, eye (inc. 
reuna), bladder, liver (nonnal, cirrhotic, acute faiiuie). 
N'nn-human prtmate tissues eTainined: 
5 Chimp Tissues: adrenal 

Rhesus Mgrlgv TiSSUW: Cerebral conex. hippocampus 

(5} PN-A4943 M2 i9rFR Q533) 

Moderate expression over cortical ncuioncs in the fetal brain. Expression over the inner aspect of the feial 
10 rctma, possible expression in the developing lens. Expression over fetal skin, cartilage, small intestine, placcmal villi 

and umbilical cord. In aduli tissues there is an exiremely high level of expression over the gallbladder epithelium. 

Moderate expression over the adult kidney, gastric and colonic cpiihelia. Low-level expression -was observed over 

many cell types in many tissues, this may be related to stickiness of the probe, these data should therefore be 

interpreted with a degree of caution. 
15 Hunwi fetal Tiisuss examined rH12-F.16 weekO infMr' Placenta, tmibilical cord, liver, kidney, adrenals, thyroid. 

hiDgs, heart, great vessels, oesophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, 

body wall, pelvis, testis and lower limb. 

A<M\ huma.T assvH 7;^agmcd: Kidney {normal and end-stage), adrenal, spleen, lymph node, pancreas, lung, eye One. 
retina), bladder, liver (normal, cirrhotic, acute failure). 
20 Non-humar. primate tiss ues examined : 
Chimp Tissues : adrenal 

Rh,e?u^ M9nX?v Ti^sye;: Cerebral cortex., hippocampus, cerebellum. 

(6) DNA3?$3g-il4i (?R024;rf 
25 Expression obscn^ed m Lhc endothelium lining a subset of fetal and placental vessels. Endothelial expression 

wa,s confined to these tissue blocks. Expression also observed over micnnediate trophoblast cells of placenta. All other 
tissues negative. 

hctal tissues examined fHl 2-E j 6 week^'i mclune - Placenta, umbilical cord. Uvcr, kidney, adrenals, thyroid, hmgs, heart, 
great vessels, oesophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye. spinal cord, body wall, pelvis 
30 and lower limb. 

Aduii tissues examined: Liver, kidney, adrenal, myocardium, aorta, spleen, lymph node, 

pancreas, lung, skm, cerebral cortex (rm), hippocampU5{rm), cerebeUura(rm), penis, eye, bladder, stomach, gastric 
carcinoma, colon, colonic carcmoma, thyroid (chimp), parath>Toid (chimp) ovaiy (chimp) and chondrosarcQma. 
Acctommophen induced liver iniury and hepatic 
35 cirrhosis 
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(7) DNA33Qg9-T13? (VR077V^ 

Specific expression over fcul cerebral white and grey maner, as well as over neurones in the spinal cord. 
Probe appears lo cross react with ra:. Low level of expression over cerebcUar neurones in adult rhesus brain. AU 
other lissucs negative. 

P:b ! Ti?SV?? exam i ned f£l2-Et^ w^^ta) miMr - Placcnu. uinbUical cord, liver, kidnc>-. adrcaaU, thyroic. lungs. 
5 bear:, great vessels, oesophagus, stomach, small intestine, spleen, thymus, pai^crcas. brain, eye, spinal cord, body 
wall, pelvis and lower limb. 

Adult tiss'JTS gi^artntii: Uver. iQdney, adrenal, myocardium, aorta, spleen, lymph node, 

pancreas, lung, skin, cerebral cortex (rra), hippocainpus(rm), ccrcbcHum(rra), penis, eye. bladder, stomach, gastric 

carcaioma, colon, colonic caranoma and chondrosarcoma. Aceiominophcn induced liver injury and hepatic cirrhosis 

10 

(8) DNrAr';9i8-1174 fPRO?S«^ 

Strong expression in the nervous system. In the rhesus monkey brain expression is observed m cortical, 
hippocampal and cerebellar neurones. Expression over spinal neurones in the fetal spinal cord, the developing brain 
ana the inner aspens of the fetal retina. Expression over developing dorsal root and autonomic ganglia as weU as 
1 5 cmrric nerves. Expression observed over ganglion cells in the adult prostate. In the rai. there is strong expression 
over the developing hind brain and spinal cord. Strong expression over interstitial cells in the placcnol' villi. All other 
tissues were negative. 

FCH l q5?'Je; C^ajpinej fE12-R10 w(;gk?) mc)l)f1 ? r : Placenta, umbilical cord, liver, kidney, adrenals, thyroid, lungs, 
heart, great vessels, oesophagus, stomach, small intestine , spleen, thymus, pancreas, brain, eye. spinal cord, body 
20 wall, pelvis and lower limb. 

Adu l l Tissues g^t a imncd : Liver, kidney, renal cell carcinoma, adrenal, aorta, spleen, lynqih 
node, pancreas, hmg, myocardium, skin, cerebral cortex (rra). hippocampus (im). cerebcilum{nn). bladder, prostate, 
stomach, gastric carcinoma, colon, colonic carcinoma, thyroid (chimp), parathyroid (chimp) ovary (chimp) and 
chondrosarcoma. Acctominophcn induced liver mjury and hepatic cirrhosis. 

25 

(9) DNA32286-nQ1 n>^r}^^) 

Fetal tissue: Low level throughout mesenchyme. Moderate expression in placental stromal cells in 
membranous tissues and in thyroid. Low level expression in cortical neurones. Adult tissue: all neganve. 
Peta l tissv;:? gMB i ngd (? . ]2-m wgeKs) incll l dg : Placenta, umbilical cord, liver, kidney, adrenals, thyroid, hings, 
hcan. great vessels, oesophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, body 
wall, pelvis and lower limb. 

AduUuss^TS gJt a m i nTd wMr Uver, kidney, adrenal, myocardium, aorta, spleen, lymph node, pancreas, lung and 

skin. 

35 (10) DNA3322Mt?l rPRmy^^ 

Expression hnuied to vascular endothelium in fetal spleen, adult spleea fetal liver, adult thyroid and adult 
lymph node (chimp). Additional site of expression is the developing 
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spinal ganglia. All otfier tissues negative. 

H'jn a n fcai l i ^SUC S CXaminrd rF , 12-£l^ u-^rk^) jn-'iii^ : Piacenii, lunb^h^al cord, livcr, kidney, adrtnah. liiyrcid. 
lungs, hcan, great vessels, oesophagus, stomach, small iniestnie. spleer. thymus, pancreas, brain, eye. spinal cord, 
body wall, pelvis and lower limb. 

Adult hniTrftP ti^SUgS g^am i nM : Kidney (normal and cod-stage), adrenal, myocardi^.:ni. aorta, splceji. lymph node, 
pancreas. lung. sJdn. eye (inc. retina), bladder. Uve: (normal. cuThotic, acute failure). 
Non-human crimate tis.?i ies examm'-rf - 

C^mg lumr - Salivary gland, stoxach, thyroid, parathyroid, skin, thymus, ovary, lymph node. 
Khc?V5 Mon^Tv TlRSWii: Cerebral cortex, hippocampus, cerebeUum. penis. 

(11) DN'A35557-I137 rPRO^^j 

Specific expression over developing motor neurones in ventral aspect of the fetal spinal cord (will develop 
into ventral horns of spinal cord). All other tissues negative. Possible role in growth, diffcrcntianon andyor 
dcvclopmcm of spinal motor neurons. 

Fetal Tissye; yxainmcd f£12-l =; 16 w(;;K^) inrlnl^ : Placenta, umbilical cord, liver, kidney, adrenals, ibyroid. hmgs. 
hcan. great vessels, oesophagus, stomach, small intestine, spleen, thymus, pancreas, brain, eye. spinal cord, body 
wall, pelvis and lower iimb. 

Adult ri55U?5 g?18mjncC : Liver, kidney, adrenal, myocardium, aoru, spleen, lynyh node, 

pancreas, hmg, skin, cerebral concx (rra). hippocampus(Tm). ccrebcllum(rm). penis, eye. bladder, stomach, gastric 

carcinoma, colon, colonic carcinoma and chondrosarcoma. Acetominopben induced Uvcr injury and hepatic cirrhosis 

(12) DNA:^^ion-nsQ rpj^m^) 

Striking expression in mononuclear phagocytes (macrophages) of fetal and adult spleen, liver, lymph node 
and adult ihymis (in tmgiblc body macrophages). The highest expression is in the spleen. All other tissues negative. 
Locaiisaaon and homology are entirely consistent with a role as a scavenger receptor for cells of the 
reticuloendothelial sysiera. Expression also observed in placental mononuclear cells. 

Kmar . tr^ ] t i sr^c; t?tain i p;^1 fF]?-n]6 w^rV^) ^^r^^^ r Placenta, umbilical cord, liver, kidney, adrenals, thyroid, 

lungs, hean. great vessels, oesophagus, stomach, small intestine, 

spleen, thymus, pancreas, brain, eye, spinal cord, body wall, pelvis and lower limb. 

AduU humm tissytfi rMm m: Kidney (normal and end-sage), adrenal, myocardhim, aorta, spleen, lymph node, 
gall bladder, pancreas, lung, skin, eye (inc. retina), prostate, bladder. Uvcr (normal, cirrhotic, acute faihire). 
Non-human nrimaTe tiaaut^^ r^m\r'fr 

Cto Timzy . Salivary gland, stomach, thyroid, parathyroid, skin, thymus, ovary, lynq)h node. 

Pire^ys Mon l tCV Tifi^lKV Cerebral cortex, hippocampus, cerebellum, penis. 

(13) DNA34431-n77n>pp2f^l) 

Wideprcad expression in human fetal tissues and placenta over mononuclear cells, probably macrophages 
+ /- lymphocytes. Tne cellular disnibuhon follows a penvascular partem in many tissues. Strong expression also seen 
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in cpj±=lial cells of ihe fetal adrenal conrx. All zdiiU tissues were negative. 

fr^v?.\ t!ssi:es examined rgI2-E16 w:cjcs^ ir-luds - PLacenii, umbilical ccrd, live:, kidney, adrenals, thyroid, lyings, 
hean, great vessels, oesophagus, stomach, small iniestinc. spleen, th>Tnus, pancreas, brain, eye, spinal cord, body 
wall. pdviE and lower limb. 

Adult tissues examined: Liver, kidney, adrenal, spleen, lymph r.ode, pancreas, lung, skin, cerebral cortex (nn). 
5 hippocampus (nn), cere'Deiliun(rni), bladder, stomach, colon and colonic carcinoma. Acetominophcn induced Uvcr 
injury and hepatic cirrhosis. 

A secondary screen evidenced expression over stromal monorojclcar cells probably hist:ocytt5. 

(14; DNA38268-n88 rPRQ295> 
10 High expression over ganglion cells in human fetal spinal ganglia and over large neuroDcs in ihe anterior horns of 

the developing spinal cord. In the aduli there is expression in the chimp adrenal medulla (neural), neurones of the 

rhesus monkey brain Ciippocanpus [+ -i- +] and cerebral cortex) and neurones in ganglia in the no rma ! adult human 

prostate (ihc only section thai contains ganglion cells, ic expression in this cell type is presumed NOT to be confined 

to the prostate). All other tiss:ues negative. 
15 Hun-tan fetal tissues examined (E12-E16 week';1 inrlnde- Placenta, umbilical cord, liver, kidney, adrenals, thyroid, 

lungs, great vessels, stomach, small intestine, spleen, thymus, pancreas, brain, eye, spinal cord, body wall, pelvis, 

testis and lower limb. 

Adult human tissues exaTmp*(^: Kidney (normal and cnd-suge), adrenal, spleen, lymph node, pancreas, lung, eye (mc, 
retina), bladder, liver (normal, cirrhotic, acute failure). 
20 Nnn-h-.iman tirimaTr tis.^nirs examined: 
Chimp Tissues: adrenal 

Rhesu!; Monkev Tiss^e^- CcTcbial cortex, hippocampus, cerebellum. 

25 The following ma'xrials have been deposited with the American Type Culture Collection, 12301 Parklawn 

Drive. Rockville, MD, USA (ATCC): 

Maltoal ATCC PgP. Nf> Deposit Date 





DNA32292'1131 


ATCC 209258 


September J 6, 1997 


30 


DNA33094^1131 


ATCC 209256 


September 16, 1997 


DKA33223-1136 


ATCC 209264 


September 16, 1997 




DNA34435-n40 


ATCC 209250 


September 16, 1997 




DNA2786M155 


ATCC 209375 


October 16. 1997 




DNA36350-1158 


ATCC 209378 


October 16. 1997 


35 


DNA32290'1164 


ATCC 209384 


October 16. 1997 


DNA35639'1172 


ATCC 209396 


October 17, 1997 




DNA33092'1202 


ATCC 209420 


Ociobci 28. 1997 




DNA49435-I219 


ATCC 209480 


November 21, 1997 




DNA35638-U41 


ATCC 209265 


September 16, 1997 


40 


DNA32298-1132 


ATCC 209257 


September 16, 1997 


DNA33089-1132 


ATCC 209262 


Sqjicmber 16, 1997 




DNA33786-1132 


ATCC 209253 


September 16. 1997 




DNA35918'1174 


ATCC 209402 


October 17, 1997 
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20 



15 



20 



25 



30 



35 



40 



45 



DNA37150-n78 

DN A3 8260-1 180 

DNA39969.:]85 

DNA32286-1191 

DNA3346M199 

DNA40628-12I6 

DNA3322M133 

DNA33107.1I35 

DNA35557-1I37 

DNA344 34-1 139 

DNA3310O-1159 

DKA3560O-1I62 

DNA34436-1238 

DNA332Q6-n65 

DNA35558-1167 

DNA35599-1168 

DKA36992-1168 

DNA34407-n69 

DNA35841-1173 

DNA33470-1175 

DNA34431-1177 

DNA39510-1181 

DNA39425-n82 

DNA4062a-1183 

DNA40604-1187 

DNA38268-1188 

DNA37151-1193 

DNA356734201 

DNA40370-12I7 

DNA42551-1217 

DNA39520-1217 

DNA41225-I2I7 

DNA43318-1217 

DKA40587-1231 

DNA4 133 8-1234 

DNA40981-1234 

DNA37 140- 1234 

DNA40982-1235 

DNA41375-I236 

DNA44 167-1243 

DNA39427-1179 

DNA40603-1232 

DNA43466-1225 

DNA43046-1225 

DNA35668-n7I 



ATCC 209401 
ATCC 209397 
ATCC 209400 
ATCC 209385 
ATCC 209367 
ATCC 209432 
ATCC 209263 
ATCC 209251 
ATCC 209255 
ATCC 209252 
ATCC 209373 
ATCC 209370 
ATCC 209523 
ATCC 209372 
ATCC 209374 
ATCC 209373 
ATCC 209382 
ATCC 209383 
ATCC 209403 
ATCC 209398 
ATCC 209399 
ATCC 209392 
ATCC 209387 
ATCC 209388 
ATCC 209394 
ATCC 209421 
ATCC 209393 
ATCC 209418 
ATCC 209485 
ATCC 209483 
ATCC 209482 
ATCC 209491 
ATCC 209481 
ATCC 209438 
ATCC 209927 
ATCC 209439 
ATCC 209489 
ATCC 209433 
ATCC 209488 
ATCC 209434 
ATCC 209395 
ATCC 209486 
ATCC 209490 
ATCC 209484 
ATCC 209371 



October H, 1997 
Oc'uObsr 17, 1997 
October P, 199'' 
October 16, 1997 
October 15. 1997 
November 7, 1997 
September 16. 1997 
September 16. 1997 
September 16. 1997 
September 16, 1997 
October 16. 1997 
October 16, 1997 
December 10, 1997 
October 16, 1997 
October 16, 1997 
October 16. 1997 
October 16, 1997 
October 16, 1997 
October 17, 1997 
October 17. 1997 
October 17, 1997 
October 17, 1997 
.October 17, 1997 
October 17. 1997 
October 17, 1997 
October 28, 1997 
October 17, 1997 
October 28, 1997 
November 21, 1997 
November 21. 1997 
November 21. 1997 
November 21, 1997 
November 21, 1997 
November 7, 1997 
June 2, 1998 
November 7, 1997 
November 21, 1997 
November 7, 1997 
NovembcT21, 1997 
November 7, 1997 
October 17. 1997 
November 21. 1997 
November 21, 1997 
November 21. 1997 
October 16, 1997 
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-niese deposit were made under the provisions of the Bud^est Treaty on the lEtermrioial Recogidlion of 
ibe Depos,: of Microorganisms for the Purpose of Patent Procedure and the Regulations thereunder {Budapest 
Treaty). This assures mamtenance of a viable culture of the deposit for 30 years from the date of deposit. Tht 
deposits wUl be made available by ATCC under the terms of the Budapest Treaty, and subject to an agreement 
between Genentech. Inc. and ATCC. which assures permanent and unresmcted availability of the progeny of the 
culture of Lhe deposit to the pubUc upon issuance of tl^ pernneru U.S. patent or upon laying open to the public of any 
U.S. or foreign patent applicabon. whichever comes first, and assures availabUity of the progeny lo one determined 
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by the U.S. Coraniissione: of Patents and Trademarks to be entiUed ihereco according lo 35 USC § 122 and tfcc 
Conmissioncr's rules pursuam thereto (including 37 CFR § 1.14 wiih parucuiar reference to 886 OG 638J. 

Tne assignes of the presem application has agreed that if a culmre of the nu'xrials on deposit should die or 
be lost or dc5Uo>'tQ when cultivaTed under suitable condiiions, ttt maierials will be pion^itly replaced on TKHificauon 
u'iih another of ihe same. Availabiliry of the deposited material is not to be construed as a license to practice the 
5 invenuon in comravention of the righu granied luidcr the authority of any govermnetil in accordance with in patent 
laws. 

The foregoing wrinen spccificaiion is considered to be sufficient to enable one skilled in the an to practice 
iht invcniion. The presem invenuon is not ;o be limited in scope by the consmin deposited, since the deposited 
embodiment is inicnded as a single illusiradon of certain aspects of the invcnoon and any constructs that are 

10 iunctioi:ally equivalent are within the scope of this invention. The deposit of maicrial herein docs not constimte an 
admission that the written description herein contained is inadequate to enable the pracnce of any aspect of the 
invention, including tibe best mode thereof, nor is it to be construed as limitijig the scope of the claims to the specific 
iUustradons tha: it represents. Indeed, various modifications of the invention in addition to those shown aad described 
herein will become apparent to those skilled in the art from fee foregoing description and fall within the scope of the 

15 appended claims. 
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W-KATTS CT_AT\fFr>1^ 

1. Uolaicd nucleic acid havicg a: lea^i 80% sequcnc- id£nr:r)' to a nucleotide scqumce that encodes 
a poijrpeptide compming an arajuo acid sequence selected from the group consisuEg cf the amino acid sequence 
sho'A-n in Figure 2 (SEQ ID N0:2). Figure 4 (SEQ ID N0:4). Figure 6 (SEQ ID N0:l2;. Figure 9 (SEQ ID NO; 18). 
Figure 11 (SEQ ID NO:23), Figure 13 (SEQ ID KO:28). Figure 15 (SEQ ID NO:34), Figure 17 (SEQ ID NO:39). 
5 Figure 19 [SEQ ID NO:49). Figure 22 {SEQ ID KO:59}, Figure 24 (SEQ ID NO:64), Figure 26 (SEQ ID NO:69), 
Figure 28 (SEQ ID NO;71). Figure 30 (SEQ ID KO:73). Figure 32 {SEQ ID NO;84), Figure 34 (SEQ ID N0:91). 
Figure 36 (SEQ ID NO:96), Figure 38 (SEQ ID N0;1D4). Figure 40 (SEQ ID N0:109), Figure 42 (SEQ ID 
NO-.IU), Figure 44 (SEQ ID K0-.U9). Figure 46 {SEQ ID NO:127), Figure 48 (SEQ ID NO:132), Figure 50 (SEQ 
ID NO-.BT). Figure 52 (SEQ ID NO: 142). Figure 54 (SEQ ID NO:148), Figure 56 (SEQ ID NO; 153). Figure 58 

10 (SEQ ID NQ:159). Figure 60 (SEQ ID NO-J64), Figure 62 (SEQ ID NO'.nO), Figure 64 (SEQ ID NO:175). Figure 
66 (SEQ ID NO: 177), Figure 68 (SEQ ID NO:185). Figure 70 (SEQ ID NO;190). Figure 72 (SEQ ID NO:195). 
Figure 74 (SEQ ID NO:201), Figure 76 (SEQ ID miQl), Figure 78 (SEQ ID NO:213), Figure 80 (SEQ ID 
KO:22I). Figure 82 (SEQ ID NO:227). Figure 84 (SEQ ID NO:236). Figure 86 (SEQ ID NO:245). Figure 88 (SEQ 
ID NO:250}. Fig^jrc 90 (SEQ ID NOi255). Figure 92 (SEQ ID NO;257), Figure 94 (SEQ ID NO:259), Figure 96 

15 (SEQ ID NO;26i). Figure 98 (SEQ ID NO:263), Figure 100 (SEQ ID NO:285), Figure 102 (SEQ ID NO:290), 
Figure 104 (SEQ ID NO;292), Figure 106 (SEQ ID NO;294), Figure 108 (SEQ ID NO:310), Figure 110 (SEQ ID 
NO:315). Figure 112 (SEQ ID NO:320). Figure 114 (SEQ ID NO:325), Figure 116 (SEQ ID NO:332). Figure 118 
(SEQ ID NO:339), Figure 120 (SEQ ID NO:341) and Figure 122 (SEQ ID NO:377). 

20 2. The nucleic acid of Ciaim 1 . wherein said nucleotide sequence comprises a nucleotide sequence 

selected from the group consisting of the sequence shown in Figure 1 (SEQ ID N0:1), Figure 3 (SEQ ID N0:3}. 
Figure 5 (SEQ ID NO: 1 1). Figure 8 (SEQ ID NO; 17). Figure 10 (SEQ ID NO:22), Figure 12 (SEQ ID NO:27), 
Figure 14 (SEQ ID NO:33), Figure 16 (SEQ ID NO:38), Figure 18 (SEQ ID NO;48), Figure 21 (SEQ ID N0:5S). 
Figure 23 {SEQ ID NO:63), Figure 25 (SEQ ID NO:68), Figure 27 (SEQ ID NO;70), Figure 29 (SEQ ID NO:72), 

25 Figure 31 (SEQ ID NO;83). Figure 33 (SEQ ID NO;90), Figure 35 (SEQ ID NOi95). Figure 37 (SEQ ID NO: 103). 
Figure 39 (SEQ ID NO:108), Figure 41 (SEQ ID N0;113), Figure 43 (SEQ ID N0:118), Figure 45 (SEQ ID 
ND:126). Figure 47 (SEQ ID N0:I31). Figure 49 (SEQ ID NO: 136), Figurr 51 (SEQ ID N0:141), Figure 53 (SEQ 
ID NO:147). Figure 55 (SEQ ID NO:152), Figure 57 (SEQ ID NO:158). Figure 59 (SEQ ID NO:163). Figure 61 
(SEQ ID NO:169), Figure 63 (SEQ ID N0:n4), Figure 65 (SEQ ID NO:176). Figure 67 (SEQ ID NO:184), Figure 

30 69 (SEQ ID NO; 189). Figure 71 (SEQ ID NO:194), Figure 73 (SEQ ID NO:200), Figure 75 (SEQ ID NO:206). 
Figure 77 (SEQ ID N0.212), Figure 79 (SEQ ID NO:220). Figure 81 (SEQ ID NO:226). Figure 83 (SEQ ID 
NO:235), Figure 85 (SEQ ID NO:244), Figure 87 (SEQ ID NO:249). Figure 89 (SEQ ID NO:254). Figure 91 (SEQ 
ID NO:256), Figure 93 (SEQ ID NO:258). Figure 95 (SEQ ID NO:260). Figure 97 (SEQ ID NO:262). Figure 99 
CSEQ ID NO:284), Figure 101 (SEQ ID N0289), Figure 103 (SEQ ID NOi291), Figures 105A-B (SEQ ID NO:293). 

35 Figure 107 (SEQ ID NO:309), Figure 109 (SEQ ID NO:314), Figure lU (SEQ ID NO:319). Figure U3 (SEQ ID 
NO:324), Figure 115 (SEQ ID NO:33:), Figure 117 (SEQ ID NO:338), Figure 119 (SEQ ID NO.340) and Figure 
121 (SEQ ID NO:376). or the complcmenr thereof. 
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3. The nucleic acid of CLaun 1, wherein said nucleotide sequence cotnprises a nucleotide sequence 
selected from the group consisting of th.c full-lengLh coding sequcncr of the sequence shown in Figure 1 (SEQ ID 
N0:1). Figure 3 (SEQ ID N0:3), Figuir 5 (SEQ ID N0:11). Figure 8 (SEQ ID NO: 17), Figure 10 (SEQ ID NO;22). 
Fig'jic 12 (SEQ ID NO;27), Figure 14 (SEQ ID NO:33), Figure 16 (SEQ ID NO:38). Figure 18 (SEQ ID NO:48), 
Figure 21 (SEQ ID NO-.58). Figure 23 (SEQ ID NO:63). Fig-Jie 25 (SEQ ID NO:68), Figure 27 (SEQ ID NO.70), 

5 Figure 29 (SEQ ID NO:72). Fipjrc 31 (SEQ ID NO:83). Fig-Jre 33 (SEQ ID N0;90). Figure 35 (SEQ ID NO:95). 
Figure 37 (SEQ ID NO: 103). Figure 39 (SEQ ID NO: 108). Figure 41 (SEQ ID NO: 113). Figure 43 (SEQ ID 
NOillS). Figure 45 (SEQ ID NO:126), Figure 47 (SEQ ID N0:131), Figure 49 (SEQ ID NO:i36). Figure 51 (SEQ 
ID NO: 141), Figure 53 (SEQ ID NO: 147), Figure 55 (SEQ ID NO:152). Figure 57 (SEQ ID NO:I58). Figure 59 
(SEQ ID NO: 163), Figure 61 (SEQ ID NO: 169). Figure 63 (SEQ ID NO; 174), Figure 65 (SEQ ID NO: 176). Figure 

10 67 (SEQ ID NO: 184). Figure 69 (SEQ ID NO:189). Figure 71 (SEQ ID NO:194). Figure 73 (SEQ ID N0:200), 
Figure 75 (SEQ ID N0;2Q6). Figure 77 (SEQ ID NO:212). Figure 79 (SEQ ID NO.220). Figure 81 (SEQ ID 
NO:226). Figure 83 (SEQ ID N0:235), Figure 85 (SEQ ID N0:244). Figure 87 (SEQ ID N0:249). Figure 89 (SEQ 
ID NO;254), Figure 91 (SEQ ID NO;256), Figure 93 (SEQ ID NO:258), Figure 95 (SEQ ID NO:260), Figure 97 
(SEQ ID NO;262). Figure 99 (SEQ ID NO:284). Figure 101 (SEQ ID NO:289), Figure 103 (SEQ ID N0:291). 

15 Figuxes 105A-B (SEQ ID NO:293), Figure 107 (SEQ ID NO:309). Figure 109 (SEQ ID NO:314). Figure 111 (SEQ 
ro NO:319). Figure 113 (SEQ ID NO:324), Figure 115 (SEQ ID NO:331), Figure 117 (SEQ ID'N0:338), Figure 
1 19 (SEQ ID NO:340) and Figure 121 (SEQ ID NO:376), or die complement thereof. 

4 . Isolated nucleic acid which comprises the fulWcngth coding sequence of the DNA deposited under 
20 accession number ATCC 209258. ATCC 209256, ATCC 209264. ATCC 209250. ATCC 209375. ATCC 209378, 

ATCC 209384. ATCC 209396, ATCC 209420, ATCC 209480. ATCC 209265. ATCC 209257. ATCC 209262. 
ATCC 209253. ATCC 209402. ATCC 209401. ATCC 209397. ATCC 209400, ATCC 209385. ATCC 209367, 
ATCC 209432, ATCC 209263, ATCC 209251. ATCC 209255. ATCC 209252. ATCC 209373. ATCC 209370. 
ATCC 209523. ATCC 209372, ATCC 209374. ATCC 209373. ATCC 209382, ATCC 209383. ATCC 209403. 
25 ATCC 209398, ATCC 209399. ATCC 209392. ATCC 209387. ATCC 209388, ATCC 209394. ATCC 209421, 
ATCC 209393. ATCC 209418. ATCC 209485. ATCC 209483. ATCC 209482. ATCC 209491. ATCC 209481, 
ATCC 209438, ATCC 209927, ATCC 209439, ATCC 209489. ATCC 209433. ATCC 209488. ATCC 209434. 
ATCC 209395, ATCC 209486. ATCC 209490. ATCC 209484 or ATCC 20937U 

30 5 A vector comprising the nucleic acid of Claim 1. 

6. The vector of Claim 5 operably linked to control sequences recognized by a host cell transformed 

with the veaor. 

35 7 . A host cell comprising the vector of Claim 5 . 

8. The host cell of Claim 7 wherein said cell is a CHO cell. 
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5, The host ccD of Claim 7 utsrein said cell is an £. coU. 
10. Tnc host cc]l of Claim 7 wherein said eel] is a yeast cell, 

11: A process for producing a PRO polypeptides comprisij^g culmring the host cell of Claim 7 under 
condinons simable for expression of said PRO poiypepiice and recovering said PRO polypeptide from the cell ailture. 

12. Isolated native sequence PRO polypeptide having ai least 80% sequence idcntiiy to an amino acid 
sequence selected from the group consisting of the, amino acid sequence shown in Figure 2 (SEQ ID N0:2), Figure 
4 CSEQ ID N0:4). Figure 6 (SEQ ID K0:12}. Figure 9 (SEQ ID N0:18), Figure 11 (SEQ ID NO:23). Figure 13 
(SEQ ID NO;28). Figure 15 CSEQ ID NO:34), Figure 17 {SEQ ID NO:39J, Figure 19 (SEQ ID NO:49), Figure 22 
(SEQ ID NO:59). Figure 24 (SEQ ID NO:64), Figure 26 (SEQ ID NO:69). Figure 28 (SEQ ID N0:71). Figure 30 
(SEQ ID NO:73), Figure 32 (SEQ ID NO:84), Figure 34 (SEQ ID N0:91), Figure 36 (SEQ ID NO:96), Figure 38 
(SEQ m NO:104). Figure 40 (SEQ ID N0:109). Figure 42 (SEQ ID N0:1 14). Figure 44 (SEQ ID N0;119). Figure 
46 CSEQ ID NO:I27). Figure 48 (SEQ ID NOa32). Figure 50 (SEQ ID NO:137). Figure 52 (SEQ ID NO:142). 
Figure 54 (SEQ ID NO: 148). Figure 56 (SEQ ID NO: 153). Figure 58 (SEQ ID NO:I59). Figure 60 (SEQ ID 
NO:164). Figure 62 (SEQ ID NO:170). Figure 64 (SEQ ID NO:175). Figure 66 (SEQ ID NO:177); Figure 68 (SEQ 
ID NO:185). Figure 70 (SEQ ID N0:190). Figure 72 (SEQ ID NO:195). Figure 74 (SEQ ID NO:201). Figure 76 
(SEQ ID NO:207). Figure 78 (SEQ ID KO:213). Figure 80 (SEQ ID NO:221). Figure 82 (SEQ ID NO:227), Figure 
84 (SEQ ID NO:236), Figure 86 (SEQ ID KO:245). Figure 88 (SEQ ID NO:250), Figure 90 (SEQ ID NO:255). 
Figure 92 (SEQ ID NO:257). Figure 94 (SEQ ID NO:259). Figure 96 (SEQ ID NO:261). Figure 98 (SEQ ID 
NO:263). Figure 100 (SEQ ID NO:285), Figure 102 (SEQ ID NO:290). Figure 104 (SEQ ID NO:292). Figure 106 
(SEQ ID NO:294}, Figure 108 (SEQ ID NO:310). Figure 110 (SEQ ID NO:3I5), Figure 112 (SEQ ID NO:320). 
Hgurr 114 (SEQ ID NO:325). Figure 116 (SEQ ID NO:332), Figure 118 (SEQ ID NO:339). Figure 120 (SEQ E) 
NO:341) and Figure 122 (SEQ ID NO:377). 

13. Isolated PRO polypeptide having at least 80% sequence idcnhty to tbc amino add sequence encoded 
b>' tiic nucleotide deposited under accession number ATCC 209258. ATCC 209256, ATCC 209264. ATCC 209250. 
ATCC 209375. ATCC 209378. ATCC 209384. ATCC 209396. ATCC 209420. ATCC 209480. ATCC 209265.' 
ATCC 209257. ATCC 209262. ATCC 209253. ATCC 209402. ATCC 209401. ATCC 209397. ATCC 209400. 
ATCC 209385. ATCC 209367. ATCC 209432. ATCC 209263, ATCC 209251. ATCC 209255. ATCC 209252, 
ATCC 209373. ATCC 209370. ATCC 209523. ATCC 209372. ATCC 209374. ATCC 209373. ATCC 209382^ 
ATCC 209383. ATCC 209403, ATCC 209398. ATCC 209399. ATCC 209392. ATCC 2093B7. ATCC 209388. 
ATCC 209394. ATCC 209421. ATCC 209393. ATCC 209418, ATCC 209485. ATCC 209483. ATCC 209482. 
ATCC 209491. ATCC 209481, ATCC 209438. ATCC 209927. ATCC 209439. ATCC 209489. ATCC 209433. 
ATCC 209488. ATCC 209434. ATCC 209395. ATCC 209486. ATCC 209490. ATCC 209484 or ATCC 209371. 
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14. A cfcminc moicaiic compming a polypcpdds accordxg id Claim 12 fused lo a heterologous amino 
acid scqurr.ce. 

15. Tne chimeric molecule cf Claiin K wherein said heterologous amino acid sequence is an epitope 
tag sequence. 

5 

16. The chimeric molecule of Claim 14 wherein said heterologous amino add sequtnce is a Fc region 
of an immunogiobulin. 

17. An aniibody which specifically binds to a PRO polypeptide according lo Claim 12. 

10 . 

18. The anribody of Claim 17 wherein said antibody is a monoclonal antibody. 

19. Isolated nucleic acid having at least 80% sequence idemity to a nucleotide sequcocc encoding a 
PR0228 polypeptide having amino acid residues 1 to 690 of Figure 19 (SEQ ID NO:49). 

It 

20. The nucleic acid of Claim 19, wherein said nucleotide sequence comprises the nucleotide sequence 
of Figure 18 (SEQ ID NO: 48), or its complement. 

2 1 The nucleic acid of Claim 19, wherein said nuclcoddc sequence comprises nucleoddes 24-2093 of 
20 Fignre IS (SEQ ID NO:48), or its complement. 

22, An isolated nudeic acid comprising the nucleotide sequence of the fiiU-lcngth coding sequence of 
clone UNQ202 (DNA33092-12O2) deposited under accession number ATCC 209420. 

2^ 23 An isolated nucleic acid encoding an Mtracellular domain of a PR0228 polypepnde. 

24 A vector comprising the nucleic acid of any one of Claim 1 9 to 23. 

25 . The vector of Claim 24 opcrably United to control sequences recognized by a host cell transformed 
30 with die vector. 

26. A host cell comprising the vector of Claim 24. 

27. The host cell of Claim 25 wherein said cell is a CHO cell. 

35 

28. The host cell of Claim 25 wherein said cell is an £. coU. 
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29 Tr^ hos: ctU of CU'jn 25 whtTein said ctU is a yeast ctll 

30. A process for producing a PR0228 polypeptid: comprising cjlnmng ±c hosr cdl of Claim 25 
under cordioons suitable for expression of said PR0228 polypeptide and recovering said PR0228 pol>'pcpddc frorr. 
the cell c'jlrure. 

5 

3 1 Isoiaicd native scqucr.ce PR0228 polypeptide comprising amino acid rcsiduw 1 to 690 of Figure 

19 (SEQTD NO:49). 

32. An isolated extracellular domain of a PR0228 polypeptide. 

10 

33. A chimeric molecule comprising a PR0228 polypeptide fused to a heterologous amino acid 

sequence. 

34. The chimeric molecule of Claim 33 wherein said hcterologotis amino acid sequence is an epitope 

15 ...g se^ucice 

35. Tnc chimeric molecule of Claim 33 wherein said heterologous amino acid sequence is a Fc region 

of immunoglobulin. 

20 36, An antibody which specifically binds to a PR0228 polypeptide. 

37 . The antibody of Claim 36 wherein said antibody is a monoclonal antibody. 

3S . A mettLod of inducing apopiosis of tumor ccUs, said method coc^rising: 
25 contacimg said tumor cells with an apoptosis -inducing amount of a PR0228 polypeptide, wherein apoptosis 

of said tumor cells is induced. 

39. Tnc method according to Claim 39, wherein said contacting in in vivo. 
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nGURE 1 

GGCC3C-.-.GCAGCACGGCCGCAGGACCTGGAGCTCCGGCTGCGTCTTCCCGCAGCGCTACCCG 
CCATGCGCCTGCCGCGCCGGGCCGCGCTGGGGCTCCTGCCGCTTCTGCTGCTGCTGCCGCCC 
GCGCCGGAGGCCGCCAAGAAGCCGACGCCCTGCCACCGGTGCCGGGGGCTGGTGGACAAGTT 
TiiJ^CCAGGGGATGGTGGACACCGGAAAaV^.GAACTTTGGCGGCGGGAACACGGCTTGGGAGG 
;iJlA.ilGACGCTGTCCAAGTAGGAGTCCAGCGAGATTCGCCTGCTGGAGATCCTGGAGGGGCTG 

tgcgagagcagcgacttcgaatgcaatcagatgctagaggcgcagg;^.ggagcacctggaggc 

CTGGTGGCTGCAGCTGAAGAGCGAATATCCTGACTTATTCGAGTGGTTTTGTGTGAAGACAC 

TGA-I^J^.GTGTGCTGCTCTCCAGGAACCTACGGTCCCGACTGTCTCGCATGCCAGGGCGGATCC 

CAGAGGCCCTGCAGCGGGAATGGCCACTGCAGCGGAGATGGGAGCAC-ACAGGGCGACGGGTC 

CTGCCGGTGCCACATGGGGrACCAGGGCCCGCTGTGCACTGACTGCATGGACGGCTACTTCA 

GCTCGCTCCGGAACGAGACCCACAGCATCTGCACAGCCTGTGACGAGTCCTGCAAGACGTGC 

TCG-GCCTGACCAACAGAGACTGCGGCGAGTGTGAAGTGGGCTGGGTGCTGGACGAGGGCGC 

CTGTGTGGATGTGGACGAGTGTGCGGCCGAGCCGCCTCCCTGCAGCGCTGCGCAGT 

TCTGTA?.GAACGCCAACGGCTCCTACACGTGCGAAGAGTGTGACTCCAGCTGTGTGGGCTGC 

ACAGGGGAAGGCCCAGGAAACTGTAAAGAGTGTATCTCTGGCTACGCGAGGGAGCACGGACA 

GTGTGCAGATGTGGACGAGTGCTCACTAGCAGAAAAAACCTGTGTGAGGAAAAACGAAAACT 

GCTAC;^J^TACTCCAGGGAGCTACGTCTGTGTGTGTCCTGACGGCTTCGAAGAAACGGAAGAT 

GCCTGTGTGCCGCCGGCAGAGGCTGAAGCCAOVGAAGGAGAAAGCCCGACACAGCTGCCCTC 

CCGCGi^J^GACCTGTAATGTGCCGGACTTACCCTTTAAATTATTCAGAAGGATGTCCCGTGGA 

AAJ^.TGTGGCCCTGAGGATGCCGTCTCCTGCAGTGaACAGCGGCGGGGAGAGGCTGCCTGCTC 

TCTAACGGTTGATTCTCATTTGTCCCTTA.AACAGCTGCATTTCTTGGTTGTTCTTAAACAGA 

CTTGTATATTTTGA7ACAGTTCTTTGTAATAAAATTGACCATTGTAGGTAATCAGGAGGAAA 
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CCAGGCCGGGAGGCGACGCGCCCAGCCGTCTAAACGGGAACAGCCCTGGCTGAGGGAG2TGC 
AGCGCAGCAGAGTATCTGACGGCGCCAGGTTGCGTAGGTGCGGCACGAGGAGTTTTCCCGGC 
AGCGAGGAGGTCCTGA.GCAGCATGGCCCGGAGGAGCGCCTTCCCTGCCGCCGCGCTCTGGCT 
CTGGAGCATCCTCCTGTGCCTGCTGGCACTGCGGGCGGAGGCCGGGCCGCCGCAGGAGGAGA 
GC C T G TAX C TATG GAT C GATG C TCACCAGGCAAGAGTACTCATAGGAT TTGAJ\GAJiGATATC 
C T GAT T G T T T C AG AGG G GAAAJiT GG C ACC T T T T ACAC AT GAT TT C AG AAJIAG C G CAACAGAG 
AJ^TGCCAGCTATTCCTGTCAATATCaATTCCATGAATTTTACCTGGCAAGCTGCAGGGCAGG 
CAGAJ^.TACTTCTATGAA.TTCCTGTCCTTGCGCrCCCTGGATAAAGGCATCATGGCAGATCCA 
ACCGTCAA.TGTCCCTCTGCTGGGAACAGTGCCTCACAAGGCATCAGTTGTTCAAGTTGGTTT 
CCGATGTCTTGGAAA-ACAGGATGGGGTGGCAGCATTTGAAGTGGATGTGATTGTTATGAATT 
CTGAAGGCAACAXCATTCTCCAAACACCTCAAAATGCTATCTTCTTTAAAACATGTCAACAA 
GCTGAGTGCCCAGGCGGGTGCCGAAATGGAGGCTTTTGTAATGAJIAGACGCATCTGCGAGTG 
TCCTGATGGGTTCCACGGACCTCACTGTGAGAAAGCCCTTTGTACCCCACGATGTATaAATG 
GTGGACTTTGTGTGACTCCTGGTTTCTGCATCTGCCCACCTGGATTCTATGGAGTGAACTGT 
GA.CAA-AGCAAACTGCTCAACCACCTGCTTTAATGGAGGGACCTGTTTCTACCCTGGAAAATG 
TATTTGCCCTCCAGGACTAGAGGGAGAGCAGTGTGAAATCAGCAJ^TGCCCACAACCCTGTC 
GAJ^TGGAGGTAAJi-TGCATTGGTAAAAGCAAATGTAAGTGTTCCAJ^GGTTACCAGGGAGAC 
CTCTGTTCAAAGCCTGTCTGCGAGCCTGGCTGTGGTGCACATGGAACCTGCCATGAACCCAA 
Ci^-AAT G C C A.AT G T CJ^-AGAAG GT T GGC ATGGAAGACACTGCAAT AAAAGGTACGAAGCCAGCC 
TCATACATGCCCTGAGGCCAGCAGGCGCCCAGCTCAGGCAGCACACGCCTTCACTTAAAAAG 
G C C G A.G G AG C G G C G G GA T C C AC C T GA_ATCCAAT TACAT C T G G TGAAC T C CGACAT CTGAAAC 
GTTTTAA.GTTACACCAAGTTCATAGCCTTTGTTAACCTTTCATGTGTTGAATGTTCAAATAA 
T3TTCATTACACTTAAGAATACTGGCCTGAATTTTATTAGCTTCATTATAAATCACTGAGCT 
GATATTTACTCTTCCTTTTAJIGTTTTCTAAGTACGTCTGTAGCATGATGGTATAGATTTTCT 
TGTTTCAGTGCTTTGGGACAGATTTTATATTATGTCAATTGATCAGGTTAAAATTTTCAGTG 
TGTAGTTGGCAGATATTTTCAAJ^TTACAATGCJITTTATGGTGTCTGGGGGCAGGGGAACAT 
GAGPJ^AGGTTAJIATTGGGCAAAJ^ATGCGTAAGTCACAAGAATTTGGATGGTGCAGTTAATGT 
TGAA-GTTACAGCATTTCAGATTTTATTGTCAGATATTTAGATGTTTGTTACATTTTTAAAAA 
TTGCTCTTAATTTTTAAACTCTCA-ATACAATATATTTTGACCTTACCATTATTCCAGAGATT 
CAGTATTAAAAJiJiJlAAAAAJVTTACACTGTGGTAGTGGCATTTAAACAATATAATATATTCTA 
AA.CAC.^-ATGAAATAGGGAATATAATGTATGAACTTTTTGCATTGGCTTGAAGCAJ^TATAATA 
TATTGTAAACAAJU^CACAGCTCTTACCTAATAAACATTTTATACTGTTTGTATGTATAAAAT 
A-AAGGTGGTGCTTTAGTTTTTTGGAAJIuAAJWVAAAAAJVAAJ^JiJU^^^ 
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FIGURE 5 

CGGACGCGTGGGCGTCCGGCGG7CGCAGAGCCAC-GAG-3CGGAGGCGCGCGGGCCAGCCTGGG 

CCCCAGCCCACACCTTCACCAGGGCCCAGGAGCCACCATGTGGCGATGTCCACTGGGGCTAC 

TGCTGTTGCTGCCGCTGGCTGGCCACTTGGCTCTGGGTGCCCAGGAGGGTCGTGGGCGCCGG 

QAGCTAGCACCGGGTCTGCACCTGCGGGGCATCCGGGACGCGGGAGGCCGGTACTGCCAGGA 

GCAGGACCTGTGCTGCCGCGGCCGTGCCGACGACTGTGCCCTGCCCTACCTGGGCGCCATCT 

GTTACTGTGACCTCTTCTGCAACCGCACGGTCTCCGACTGCTGCCCTGACTTCTGGGACTTC 

TG C CTCGG CG TG CCAC C C C CTTTTCCCCCGATCCAAGGATGTATGCATGGAGGTCGTATCTA 

TCCAGTCTTGGGAACGTACTGGGACAACTGTAACCGTTGCACCTGCCAGGAGAACAGGCAGT 

GGCAGTGTGACC^J^GAACCATGCCTGGTGGATCCAGACATGATCAAAGCCATCAACCAGGGC 

AACTATGGCTGGCAGGCTGGGAACCACAGCGCCTTCTGGGGCATGACCCTGGATGAGGGCAT 

TCGCTACCGCCTGGGCACCATCCGCCCATCTTCCTCGGTCATaAACATGCATGAAATTTATA 

CAGTGCTGAACCCAGGGGAGGTGCTTCCCACAGCCTTCGAGGCCTCTGAGAAGTGGCCCAAC 

CTGATTGATGAGCCTCTTGACCAAGGCAACTGTGCAGGCTCCTGGGCCTTCTCCACAGCAGC 

TGTGGCATCCGATCGTGTCTCAATCCATTCTCTGGGACACATGACGCCTGTCCTGTCGCCCC 

AGAACCTGCTGTCTTGTGACACCCACCAGCAGCAGGGCTGCCGCGGTGGGCGTCTCGATGGT 

GCCTGGTGGTTCCTGCGTCGCCGAGGGCTGGTGTCTGACCACTGCTACCCCTTCTCGGGCCG 

TGAACGAGACGAGGCTGGCCCTGCGCCCCCCTGTATGATGCACAGCCGAGCCATGGGTCGGG 

GCAAGCGCCAGGCCACTGCCCACTGCCCCAACAGCTATGTTAATAACAATGACATCTACCAGGTC 

ACTCCTGTCTACCGCCTCGGCTCCAACGACAAGGAGATCATGAAGGAGCTGATGGAGAATGG 

CCCTGTCCAAGCCCTCATGGAGGTGCATGAGGACTTCTTCCTATACAAGGGAGGCATCTACA 

ncCACACGCCAGTGAGCCTTGGGAGGCCAGAGAGATACCGCCGGCATGGGACCCACTCAGTC 

AAGATCACAGGATGGGGAGAGGAGACGCTGCCAGATGGAAGGACGCTCAAATACTGGACTGC 

GGCCAACTCCTGGGGCCCAGCCTGGGGCGAGAGGGGCCACTTCCGCATCGTGCGCGGCGTCA 

ATGAGTGCGACATCGAGAGCTTCGTGCTGGGCGTCTGGGGCCGCGTGGCCATGGAGGACATG 

GGTCATCACTGAGGCTGCGGGCACCACGCGGGGTCCGGCCTGGGATCCAGGCTAAGGGCCGG 

CGGAAGAGGGCCCAATGGGGCGGTGACCCCAGCCTCGCCCGACAGAGCCCGGGGCGCAGGCG 

GGCGCCAGGGCGCTAATCCCGGCGCGGGTTCCGCTGACGCAGCGCCCCGCCTGGGAGCCGCG 

GGCAGGCGAGACTGGCGGAGCCCCCAGACCTCCCAGTGGGGACGGGGCAGGGCCTGGCCTGG 

GAAGAGCACAGCTGCAGATCCCAGGCCTCTGGCGCCCCCACTCAAGACTACCAAAGCCAGGA 

CACCTCAAGTCTCCAGCCCCAATACCCCACCCCAATCCCGTATTCTTU'irri T ri'in'rrr r AGA 

AGGGTCTTGCTCCG7TGCCCAGGTTGGAGTGCAGTGGCCCATCAGGGCTCACTGTAACCTCC 

GACTCCTGGGTTCAAGTGACCCTCCCACCTCAGCCTCTCAAGTAGCTGGGACTACAGGTGCA 

CCACCACACCTGGCTAATTTTTGTATTTTTTGTAAAGAGGGGGGrCTCACTGTGTTGCCCAG 

GCTGGTTTCGAACTCCTGGGCTCAAGCGGTCCACCTGCCTCCGCCTCCCAAAGTGCTGGGAT 

TG CAGG CATGAG CCACTG CACCCAGCCCTGTATTCTTATTCTTCAGATATTTATTTTTCTTT 

T C ACTGTT7T AA-AATAAAACC AAAGT ATTGATAAAAAAAAA 
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FTGTJRE 6 

:><ss .DNA33223 

xsubunit 1 of 1, 4 £7 aa, i step 
><VW: 523 67, pi: 6.55, N>L[S/T): 2 

^TOC?LGLLLLI;PI^2,GHLJ:J:,GAQQGRGRRELAF3LHLRGIRDAGGHYCQEQDLCCRGRAD 
DCALPYLGAICYCDLFCNRTVSDCCPDFWDFCLGVFPFFPPIQGCMGGRIYPVLGrYWD 
NCNHCTCQENRQWQCDQEPCLVDPDMIKAINQGNYGWQAGNHSAFWGMTLDEGIRYRLGT 
I RP 3 S SV^I^T^IHE I YTVIJiJPGEVLPTAFSAS EKWFNLI HEPLDQGNCAGSWAFSTAAVASD 
RVSIHSLGHMTPVLSPQNLLSCDTHQQQGCRGGRLDGAWWFLRRRGWSDHCYPFSGREH 
DRAG ? AP P CMMK SRAMGRGKRQ AT AHC PNS YVMmD I YQVT F VYRLGSNDKE IMKELMEN 
GPVQAK^VHEDFFLYKGGIYSHTPVSLGRPERYRRHGTHSVKITGVJGEETLPDGRTLKY 
WTAAI^SWGPAWGERGHFRIVRGVNECDIESFVLGWGRVGMEDMGHH 
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FIGUR E 7 

AGGCTCCTTGGCCCTTTTTCCACAGCAAGCTTKTGCKATCCCGATTCGTTGTCTCAAATC 
CA-ATTCTCTTGGGACACATNACGCCTGTCCTTTNGCCCCAGAACCTGCTGTGTTGTACAC 
CCACCAGCAGCAGGGCTGCCGCGNTGGGCGTCTCGATGGTGCCTGGTGGTTCCTGCGTCG 
CCGAGGGrfTGGTGTCTGACCACTGCTACCCCTTCTCGGGCCGTGAACGAGACGAGGCTGG 
CCCTGCGCCCCCCTGTATGATGCACAGCCGAGCCATGGGTCGGGGCAAGCGCCAGGCCAC 
TGCCCACTGCCCCAACAGC7ATGTTAATAACAATGACATCTACCAGGTCACTCCTGTCTA. 
CCGCCTCGGCTCCAACGACAAGGAGATCATGAAGGAGCTGATGGAGAATGGCCCTGTCCA 
AGGCCTCATGGAGGTGCATGAGGACTTCTTCCTATACAAGGGAGGCATCTACAGCCACAC 
GCCAGTGAGCCTTGGGAGGCCAGAGAGATACCGCCGGGATGGGACCCACTCAG 
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FIGURE 8 

GCTGCTTGCCCTGTTGATGGCAGGCTTGGCCCTGCAGCCAGGCACTGCCCTGCTGTGCTACT 
C C i G -.lUUiG CC CAG GTG AG CAACGAGGACTGCCTGCAGGTGGAGAACTGCACCCAGCTGGGG 
GAGCAGTGCTGGACCGCGCGCATCCGCGCAGTTGGCCTCCTGACCGTCATCAGCAAAGGCTG 
CAGCTTGAJICTGCGTGGATGACTCACAGGACTACTACGTGGGCAJIGAJ^GAACATCACGTGCT 
GTGACACCGACTTGTGCAACGCCAGCGGGGCCCATGCCCTGCAGCCGGCTGCCGGGATGCTT 
GCGCTGCTCCCTGCACTCGGCCTGCTGCTCTGGGGACCCGGCCAGCTATAGGCTCTGGGGGG 
CCCCGCTGCAGCCCACACTGGGTGTGGTGCCCCAGGCCTCTGTGCCA-CTCCTCACAGACCTG 
GCCCAGTGGGAGCCTGTCCTGGTTCCTGAGGCACATCCTAACGCAAGTCTGACCATGTATGT 
CTGCACCCCTGTCCCCCACCCTGACCCTCCCATGGCCCTCTCCAGGACTCCCACCCGGCAGA 
TCAGCTCTAGTGACACAGATCCGCCTGCAGATGGCCCCTCC^^CCCTCTCTGCTGCTGTTTCCAT 
GGCCCAGCATTCTCCACCCTTAACCCTGTGCTCAGGCACCTCTTCCCCCAGGAAGCCTTCCC 
TGCCCACCCCATCTATGACTTGAGCCAGGTCTGGTCCGTGGTGTCCCCCGCACCCAGCAGGG 
GACAGGCACTCAGGAGGGCCCAGTAAAGGCTGAGATGAAGTGGACTGAGTAGAAGTGGAGGA 
CAAGAGTCGACGTGAGTTCCTGGGAGTCTCCAGAGA.TGGGGCCTGGAGGCCTGGAGGAAGGG 
GCCAGGCCTCACATTCGTGGGGCTCCCTGAATGGCAGCCTGAGCACAGCGTAGGCCCTTAAT 
AJ^iACACCTGTTGGATAAGCCAAAAAAA 
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FIGURE 9 

r^ACOALQPGTALLCYSCKAQVSNZDCLO'N^NCTQLGEQCWTAF-IRAVGLLTVISKGCSLN 
DDS wH'YY'/GKKN I TCCDTDLCNASGAI-IALQPAAAILALLFALGLLLViGFGQL 
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FI GU RE 10 

ATGGGAGCCGCCCGCCTGCTGCCCP_^.CCTC?.CTCTGTGCTTACAGCTGCTGATTCTCTGCTG 
TGAAACTCAGTACGTGAGGGACCAGGGCGCCATGAGCGACCAGCTGAGCAGGCGGCA.GATCC 
GCGAGTACCAACTCTAGAGCAGGACCAGTGGCAAGCACGTGCAGGTCAGCGGGCGTCGCATC 
TCCGCCACCGCCGAGGACGGCAAC;iJi.GTTTGCCAAGCTCATAGTGGAGACGGACACGTTTGG 
CAGCCGGGTTCGCATCAAAGGGGCTGAGAGTGAGAAGTACATCrGTATGAACA?.GAGGGGCA 
AGCTCA?CGGG;^-AGCCCAGCGGG;i-AGAGCAJ^AGACTGCGTGTTaACGGAGATCGTGCTGGAG 
.AACA.ACTATACGGCCTTCCAGAACGCCCGGCA-CGAGGGCTGGTTCATGGCCTTCACGCGGCA 
GGGGCGGCCCCGCCAGGCTTCCCGCAGGCGCCAGAACCAGCGCGAGGCCCACTTCATCAAGC 
GCCTCTACCAAGGCCAGCTGCCCTTCCCCAACCACGCCGAGAAGCAGAJi.GCAGTTCGAGTTT 
GTGGGCTCCGCCCCCACCCGCCGGACCAAGCGCACACGGCGGCCCCAGCCCCTCACGTAGTC 
TGGGAGGCAGGGGGCAGCAGCCCCTGGGCCGCCTCCCCACCCCTTTCCCTTCTTAATCCAAG 
GAGTGGGCTGGGGTGGCGGGAGGGGAGCCAGA.TCCCCGAGGGAGGACCCTGAGGGCCGCGAA 
CLATCCGAGCCCCCAGCTGGGAAGGGGCAGGCCGGTGCCCCAGGGGCGGCTGGCACAGTGCC 
CCCTTCCGGGACGGGTGGCAGGCCCTGGAGAGGAACTGAGTGTCACCCTGATCTCAGGCCAC 
CAGCCTTTGCCGGCCTCCCAGCCGGGCTCCTGAAGCCCGCTGAAAGGTCAGCGACTGAAGGC 
GTTGCAGACAACCGTCTGGAGGTGGCTGTCCTCAAAATCTGCTTCTCGGATCTCCCTCAGTC 
TGCCCCCAGCCCCCAAJ^CTCCTCCTGGCTAGACTGTAGGAAGGGACTTTTGTTTGTTTGTTT 
G T T T GAG GAJiJU^JVAGAAJi.GGGAGAGAGAGGAAAATAGAGGGTTGTCCACTCCTCACAT TCCA 
CGACCCAGGCCTGCAGCCCACCCCCAAGTCCCAGCCCCGGAATAAA^XCATTTTCCTGC 
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FIGURE 11 
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FIGTJRF 12 

ACrTGCCATCACCTGTTGCCAGTGTGGAAAAATTCTCCCTGTTGAATTTTTTG'CAC;iTGGAG 
GACAGCAGCAAAGAGGGCAACACAGGCTGATAAGACCAGAGACAGCAGGGAGATTAT'^'T'AC 
CATACGCCCTCAGGACGTTCCCTCTAGCTGGAGTTCTGGACTTCAAC.z^GAACCCCA'T^CCAG" 
CATTTTGATTTTGCTGTTTATTTTTTTTTTCTTTTTCTTTTTCCCACCACATTGTATTTO'^ 
TTCCGTACTTCAGAJyiTGGGCCTACAGACCACAAAGTGGCCCAGCCATGGGGCTTTTT^CCT 
GAAGTCTTGGCTTATCATTTCCCTGGGGCTCTACTCACAGGTGTCCAAACTCCTGGCCTGCC 
CTAGTGTGTGCCGCTGCGACAGGAACTTTGTCTACTGTAATGAGCGAAGCTTGACCTCAGTG 
CCTCTTGGGATCCCGGAGGGCGTAACCGTACTCTACCTCCACAACAACCAAATTAATAJ^'T'GC 
TGGATTTCCTGCAGAACTGCACAATGTACAGTCGGTGCACACGGTCTACCTGTATGGCAACC 
AACTGGACGAATTCCCCATGAACCTTCCCAAGAATGTCAGAGITCrCCArrTGCAGGAAAACAAT 

attcagaccatttcacgggctgctcttgcccagctcttgaagcttgaagagctgcacctgg;^ 

TGACAACTCCATATCCACAGTGGGGGTGGAAGACGGGGCCTTCCGGGAGGCTATTAGCC'T'CA 

aattgttgtttttgtctaagaatcacctgagcagtgtgcctgttgggcttcctgtggacttg 

CAAGAGCTGAGAGTGGATGAAAATCGAATTGCTGTCATATCCGACATGGCCTTCCAGAATCT 

cacgagcttggagcgtcttattgtggacgggaacctcctgaccaacaagggtatcgccgagg 

GCACCTTCAGCGATCTCACCAAGCTCAAGGAATTTTCAATTGTACGTAATTCGCTGTCCCAC 

cctcctcccgatctcccaggtacgcatctgatcaggctctatttgcaggacaaccagataaa 

CCACATTCCTTTGACAGCCTTCTCAAATCTGCGTAAGCTGGAACGGCTGGATATATCCAACA 

accaactgcggatgctgactcaaggggtttttgataatctctccaacctgaagcagctcact 

GCTCGGAATAACCCTTGGTTTTGTGACTGCAGTATTAAATGGGTCACAGAATGGCTCAAATA 

tatcccttcatctctcaacgtgcggggtttcatgtgccaaggtcctgaacaagtccggggga 

TGGCCGTCAGGGAATTAAATATGAATCTTTTGTCCTGTCCCACCACGACCCCCGGCCTGCCT 

CTCTTCACCCCAGCCCCAAGTACAGCTTCTCCGACCACTCAGCCTCCCACCCTCTCTATTCC 

AAJ^CCCTAGCAGAAGCTACACGCCTCCAACTCCTACCACATCGAAACTTCCCACGATTCCTG 

ACTGGGATGGCAGAGAAAGAGTGACCCCACCTATTTCTGAACGGATCCAGCTCTCTATCCAT 

TTrGTGAATGATACTTCCATTCT^GTCAGCTGGCTCTCTCTCTTCACCGTGATGGCATACAA 

-^"CACATGGGTGAAAATGGGCCACAGTTTAGTAGGGGGCATCGTTCAGGAGCGCATAGTCA 

'GGTGAGAAGCAACACCTGAGCCTGGTTAACTTAGAGCCCCGATCCACCTATCGGATTTGT 
AGTGCCACTGGATGnTTTTaB.nTTvr^r'^^^/^r.r.m^o* , ^, ^« 1 



T 
CA 



^^r^;^^-S^!^^5^^^C---CTTTCTGCTGGCGGGCTTGATCGGGGGCGCGGTGATATTTGTG 
CTGGTGGTCTTGCTCAGCGTCTTTTGCTGGCATATGCACAAAAAGGGGCGCTACACCTCCCA 
GAAGTGGAAATACAACCGGGGCCGGCGGAAAGATGATTATTGCGAGGCAGGCACCAAGAAGG 
AaAACTCCATCCTGGAGATGACAGAAACCAGTTTTCAGATCGTCTCCTTAAATAACGATCAA 
^T^^TTAAAGGAGATTTCAGACTGCAGCCCATTTACACCCCAAATGGGGGCATTAATTACAC 
„ . ^--CATATCCCCAACAACATGCGATACTGCAACAGCAGCGTGCCAGACCTGGAGCACT 

Sx;?S^Si??^:f^^^^^"-^^<^^CAGATTACATTTGATAAATGTTACACAGATG^^ 
A':::;™;^i^^^^^^'^^'^^'^'^TATACGGTGTACTATATAATGGGATTTAAAAA^ 
i . . . X -TATTTCAAGTTAATTACAAACAGTTTTGTAACTCTTTGCTTTTTAAATCTT 
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FIGURE 13 

X1155/SS .DKA26350 

xB'SiD'J^T.-^z 1 cf €6G aa, 1 step 

xMW: 7^045, pi: 8.05, NX(£/T): 7 

^^GLQrTKW?SHGAFFLKSWLIISLC-LYSQV£KLLACPSVCRCDRNFVYC:NERSLTSV?LG 
I PEGVT^vTLYLHNNQ IKNAG? P AELHNVQSVHTVYLYGNQUDEFpmLPKN^ 
I QT I SHJ^^iAQLLKI^EL^Z.DDNS ISTVGVEDGAFRElAISLiaJuFLS KJT:^^ 
DLQELRVTENllIAVISDMAFQKLTSLERLIVDGNLLTNKGIAEGTFSHLTKLKEFSIVRN 
S LS KP ? PDLPGTHL I RLYLQDNQ INHIPLTAFSNLRKLERLD ISNNQLRMLTQGVFDNLS 
KLKQLTARNNPV.T CDCS I KWVTEWLKY I PS SLNVRGFMCQGPEQVRGMAVH3LNMN-LLSC 
PTTT?GL?LFTPA?£TA£FTTQPFTLSIFNFSRSYTPPTPTTSKLPTI?DWDGRERVTPP 
I£ERIQLSIHFVraT£IQVSVn:.SLFTVMAYKLTWVKMGHSLVGGIVQERIVSGEKQHLSL 
VN:.E ?RS TYRI CLVPLDAF^^^RAVEDT I CSEATTHAS YIJWGSNTASSrtEQTTSHSMGSP 
FI^S AGl I GGAVI F VLWLLS VFCWHMHKKGR YTSQKWKYNRGRRKDDYCE AGTKKDNS I L 
rrvTETSFQIVSLNNDQLLKGDFRLQPIYTPNGGINYTDCHIPNNMRYCNSSVPDLEHCHT 
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FTGTJRF14A 

ACTTGGAGCAAGCGGCGGCGGCGGAGACAGAGGCAGAGGCAGAAGCTGC-C-GCTCCGTCCTCG 
CCTCCCACGAGCGATCCCCGAGGAGAGCCGCGGCCCTCGGCGAGGCGAAGAGGCCGi^CGAGG 
Aii.GACCCGGGTGGCTGCGCCCCTGCCTCGCTTCCCAGGCGCCGGCGGCTGCAGCCTTGCCCC 
TCTTGCTCGCCTTGAAAATGGAAAAGATGCTCGCAGGCTGCTTTCTGCTGATCCTCGGACAG 
ATCGTCCTCCTCCCTGCCGAGGCCAGGGAGCGGTCACGTGGGAGGTCCATCTCTAGGGGCAG 
ACACGCTCGGACCCACCCGCAGACGGCCCTTCTGGAGAGTTCCTGTGAGAACAAGCGGGCAG 
ACCTGGTTTTCATCATTGACAGCTCTCGCAGTGTCAACACCCATGACTATGCAAAGG-CAAG 
GAGTTCATCGTGGACATCTTGCAATTCTTGGACATTGGTCCTGATGTCACCCGAGTGGGCCT 
GCTCCAATATGGCAGCACTGTCAAGAATGAGTTCTCCCTCAAGACCTTCAAGAGGAAGTCCG 
AGGTGGAGCGTGCTGTCAAGAGGATGCGGCATCTGTCCACGGGCACCATGACTGGGCTGGCC 
ATCCAGTATGCCCTGAACATCGCATTCTCAGAAGCAGAGGGGGCCCGGCCCCTGAGGGAGAA 
TGTGCCACGGGTCATAATGATCGTGACAGATGGGAGACCTCAGGACTCCGTGGCCGAGGTGG 
CTGCTAAGGCACGGGACACGGGCATCCTAATCTTTGCCATTGGTGTGGGCCAGGTAGACTTC 
AACACCTTGAAGTCCATTGGGAGTGAGCCCCATGAGGACCATGTCTTCCTTGTGGCCAA1TT 
CAGCCAGATTGAGACGCTGACCTCCGTGTTCCAGAAGAAGTTGTGCACGGCCCACATGTGCA 
GCACCCTGGAGCATAACTGTGCCCACTTCTGCATCAACATCCCTGGCTCATACGTCTGCAGG 
TGCAAACAAGGCTACATTCTCAACTCGGATCAGACGACTTGCAGAATCCAGGATCTGTGTGC 
CATGGAGGACCACAACTGTGAGCAGCTCTGTGTGAATGTGCCGGGCTCCTTCGTCTGCCAGT 
GCTACAGTGGCTACGCCCTGGCTGAGGATGGGAAGAGGTGTGTGGCTGTGGACTACTGTGCC 
TCAGAAAACCACGGATGTGAACATGAGTGTGTAAATGCTGATGGCTCCTACCTTTGCCAGTG 
CCATGAAGGATTTGCTCTTAACCCAGATGAAAAAACGTGCACAAGGATCAACTACTGTGCAC 
TGAACAAACCGGGCTGTGAGCATGAGTGCGTCAACATGGAGGAGAGCTACTACTGCCGCTGC 
CACCGTGGCTACACTCTGGACCCCAATGGCAAAACCTGCAGCCGAGTGGACCACTGTGCACA 
GCAGGACCATGGCTGTGAGCAGCTGTGTCTGAACACGGAGGATTCCTTCGTCTGCCAGTGCT 
CAGAAGGCTTCCTCATCAACGAGGACCTCAAGACCTGCTCCCGGGTGGATTACTGCCTGCTG 
AGTGACCATGGTTGTGAATACTCCTGTGTCAACATGGACAGATCCTTTGCCTGTCAGTGTCC 
TGAGGGACACGTGCTCCGCAGCGATGGGAAGACGTGTGCAAAATTGGACTCTTGTGCTCTGG 
GGGACCACGGTTGTGAACATTCGTGTGTAAGCAGTGAAGATTCGTTTGTGTGCCAGTGCTTT 
GAAGGTTATATACTCCGTGAAGATGGAAAAACCTGCAGAAGGAAAGATGTCTGCCAAGCTAT 
AGACCATGGCTGTGAACACATTTGTGTGAACAGTGACGACTCATACACGTGCGAGTGCTTGG 
AGGGATTCCGGCTCGCTGAGGATGGGAAACGCTGCCGAAGGAAGGATGTCTGCAAATCAACC 

GGGATTTGTTCTAGCTGAGGACGGAAGACGGTGCAAGAAATGCACTGAAGGCCCAATTGACC 

1 - : ' ^^^'^'^^™^^TCCTTGACAATTTCCCCCAAAGCCGCTCGAGTGGGGCTGCT 

? i -A-^^^-GGTCCACACAGAGTTCACTCTGAGAAACTTCAACTCAGCCAAAGACA 
:?^^t:^-?^^^^^-^^^^^^A'^GAAATACATGGGAAAGGaCTCTATGACTGGG 
^-^^^^^*;'^'^'^T^^^^GAAGTrTTACCCAAGGAGAAGGGGCCAGGCCCCTTTCCACAAGGGT 

^;^^?^^^^??JS^*'r^GTATCACTATGTATGCTGTTGGGGTAGGAAAAGCCATTGAGGAG 

^^?I^^^S^^^*^^^^'=^T^'^^G-C:CACAAACAAGCATCTCTTCTATGCCGAAGACTTCAG 
CACAATvjGATGAGATAAGTGAAAAArTraanaa lirir^rTi'rr-'rr^'rr.-n Tvrir-rn^rri, r^,, , 



... ^.w^^^v^^^.^ ^ ^ ^ ^^^^^^^(^i^e-rL-CCAAAAACGGTCCAACAGCCAACAGAATC^ 

^^^^^^^?;H^^^^;^-^-^atccaagacctac^ 

ATATCTGTTTGAAGAAGACAATCTTTTACGGTCTACACAAAAGCTTTCCCATTCAACAAAAC 

cttcaggaagccctttggaagaaaaacacgatcaatgcaaatgtgaa^ 

;^S^f?^;^^^^-^-^^G^GAAGTAAGAAAATTAACACAGCGCT^^ 

AATu-oAAGwCCTGGAAAATCGCCTGAGATACAGATGAAGATTAGAAATCGCGACACATTTGT 
^^^^^r^fl^T^^^^^A^CAATGAACGCAGTGCAGAGCCCCAAA^ 

AATCAATAATGTTGTGAAGTAAAACAATCAGTACTGAGAAACCTGGTTTGCCACAGAACAAA 

gac;^^.gaagtatacactaacttgtataaatttatctaggaaaaaaatccttSS^ 

CAA.TTGCTTCT^^u.TCATCCTGCCTTAGTGTGCAATCTCATTTGACTATACGATAAAGTTT 
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nGURE 14P 

GCACAGTCTTACTTCTGTAGAAOVCTGGCCATAGGAAATGCTGTTTTTTTGTACTGGACTTT 
ACCTTGATATATGTATATGGATG?A?GCATAAAATCATAGGACATATG7ACTTGTGGAACAA 
G T T G GAT T T T TT AT A C AAT ATT AAAATT CACCACTTCAG 
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FIGURE I S 

>cll64/ss .DNA32290 

xsub'jinit 1 of 1; 915 aa , 1 stop 

xMW: 102233, pi: 6.02, NX(S/T): 1 

MEK>lLAGCFLLILGQIVLLPASARERSRGRSIERGRHAKrriPQTALLESSCENKR.AX)LVF 
IIDSSRSVNTHDYAiC^K^FIVDILQFI^IGPDVTRVGLLQYGSTVKNEFSLKTFKRKSEV 
ERAVKR>IRHLSTGTMTGI^IQYA1JN"AFSEAEGARPLRENVPRVIMIVTDGRPQDSVAEV 
AAKARDTGILIFAIGVGQVDFNTLKSIGSEPKEDHVFLVANFSQIETLTSVFQKKLCTAK 
MC3TLEHXCAHFCINIPGSYVCRCKQGYILNSDQTTCRIQDLCA.MEDHNCEQLCVNVPGS 
n/CQ C Y S GYAIJVEDGKRCVAVDYCAS ENHGCEHE CVNADGS YLCQ CHEGFALNPDEKTCT 
RI^TYCAI^KPGCEHECVNMEESYYCRCKRGYTLDPNGKTCSRVDHCAQODHGCEQLCIOT 
EDSF\^CQCSEGFLINEDLKTCSRVDYCXLSDHGCEYSCVNMDRSFACQCPEGHVLRSDGK 
TCAKLDSCALGDHGCEKSCVSSEDSFVCQCFEGYILREDGKTCRRKDVCQAIDHGCEHIC 
WSDDS YTCECLEGFRLAEDGKRCRRKDVCKSTHHGCEHI CVNKGNSYI CKCSEGFVIiAE 
DGRRCKKCTEGPIDLVFVIDGSKSLGEENFSWKQFVTGIIDSLTISPKAARVGLLQYST 
QVHTEFTLRNFNSAIODMKKAVAHMKYMGKGSMTGLALKHMFERSFTQGEGARPLSTRVPR 
AAIVFTDGRAQDBVSEWASKAKANGITMYAVGVGKAIEEELOEIASEPTNKHLFYASDFS 
TMDE I S S KLKKGI CEALEDSDGRQDS PAGSLPKTVQQPTESEPVTINIQDLLSCSNFAVQ 
HRYLFEEDNLLRSTQKLSHSTKPSGSPLEEKHDQCKCENLIMFQNLANEEVRKLTQRLEE 
MTQR^TEALENRL■RYR 
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FIGURE 16 

GG;.X-CCGCCC7GG3TGTCAGCGGC7CGGCTCCC:GCGCACGCTCCGGC:CGTCGCGCAGCCTCG 

GCACCTGCAGGTCCGTGCGTCCCGCGGCTGGCGGCCCTaA.CTCCGTCCCGGCCAGGGAGGGC 

CATGATTTCCCTCCCGGGGCCCCTGGTGACCAACTTGGTGCGGTTTTTGTTCCTGGGGCTGA 

GTGCCCTCGCGCCCCCCTCGCGGGCCCAGCTGCAACTGCACTTGCCCGCCAACCGGTTGCAG 

GCGGTGGAGGGAGGGGAAGTGGTGCTTCCAGCGTGGTACACCTTGCACGGGGAGGTGTCTTC 

ATCCCAGCCATGGGAGGTGCCCTTTGTGATGTGGTTCTTCAAACAGAAAGAAAAGGAGGATC 

AGGTGTTGTCCTACATCAATGGGGTCACAACAAGCAAACCTGGAGTATCCTTGGTCTACTCC 

ATGCCCTCCCGGAACCTGTCCCTGCGGCTGGAGGGTCTCCAGGAGAAAGACTCTGGCCCCTA 

CAGCTGGTCCGTGAATGTGCAAGACAAACAAGGCAAATCTAGGGGCCACAGCATCAAAACCT 

TAGAACTCAATGTACTGGTTCCTCCAGCTCCTCCATCCTGCCGTCTCCAGGGTGTGCCCCAT 

GTGGGGGCAAACGTGACCCTGAGCTGCCAGTCTCCAAGGAGTAAGCCCGCTGTCCAATACCA 

GTGGGATCGGCAGCTTCCATCCTTCCAGACTTTCTTTGCACCAGCATTAGATGTCATCCGTG 

GGTCTTTAAGCCTCACCAACCTTTCGTCTTCCATGGCTGGAGTCTATGTCTGCAAGGCCCAC 

A7.TGAGGTGGGCACTGCCCAATGTAATGTGACGCTGGAAGTGAGCACAGGGCCTGGAGCTGC 

AGTGGTTGCTGGAGCTGTTGTGGGTACCCTGGTTGGACTGGGGTTGCTGGCTGGGCTGGTCC 

TCTTGTACCACCGCCGGGGCAAGGCCCTGGAGGAGCCAGCCAATGATATCAAGGAGGATGCC 

ATTGGTCCCCGGACCCTGCCCTGGCCCAAGAGCTCAGACACAATCTCCAAGAATGGGACCCT 

TTCCTCTGTCACCTCCGCACGAGCCCTCCGGCCACCCCATGGCCCTCCCAGGCCTGGTGCAT 

TGACCCCCACGCCCAGTCTCTCCAGCCAGGCCCTGCCCTCACCAAGACTGCCCACGACAGAT 

GGGGCCCACCCTCAACCAATATCCCCCATCCCTGGTGGGGTTTCTTCCTCTGGCTTGAGCCG 

CATGGGTGCTGTGCCTGTGATGGTGCCTGCCCAGAGTCAAGCTGGCTCTCTGGTATGATGAC 

CCCACCACTCATTGGCTAAAGGATTTGGGGTCTCTCCTTCCTATAAGGGTCACCTCTAGCAC 

AGA3GCCTGAGTCATGGGAAAGAGTCACACTCCTGACCCTTAGTACTCTGCCCCCACCTCTC 

TTTACTGTGGG;i_AAACCATCTCAGTAAGACCTAAGTGTCCAGGAGACAGAAGGAGAAGAGGA 

AGTGGATCTGGAATTGGGAGGAGCCTCCACCCACCCCTGACTCCTCCTTATGAAGCCAGCTG 

CTGAAATTAGCTAGTCACCAAGAGTGAGGGGCAGAGACTTCCAGTCACTGAGTCTCCCAGGC 

CCCCTTGATCTGTACCCCACCCCTATCTAACACCACCCTTGGCTCCCACTCCAGCTCCCTGT 

ATTGATATAACCTGTCAGGCTGGCTTGGTTAGGTTTTACTGGGGCAGAGGATAGGGAATCTC 

TTATTAAAACTAACATGAAATATGTGTTGTTTTCATTTGCAAATTTAAATAAAGATACATAA 
TGTTTGTATGAAAAA 
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FIGURE 17 

X1172/SS .DNA35e3S 

xsubunit: 1 cf 1, 3 90 aa, 1 stop 

><m: 41176, pi: 9.61, NX(3/T): 5 

MISLFGFLVTNLLRFLFLGLSAIAPPSRAQLQLHLPA^TILQAVEGGEVVLPAWYTLHGE^ 
SSSQPWEVPFVMWFFKQKEKEDQVLSYINGVTTSKPGVSLVYSMPSRNLSLRLEGLQEKD 
SGPyS"S\^^^QDKQGKSRGHSIKTLELiNVLVPPAPPSCRLQGV?HVGAKVTLSCQSPRSK 
PAVQYQWDRQLPSFQTFFAPAIJ5VIRGSLSLTKLSSSMAG^/YVCKAHNEVGTAQC?r/TLE 
VSTGPGAAWAGAWGTLVGLGIJ^GLVLLyr3RGKALEE?AOTIKEDAIAFRTLPWPKS 
SDTISKNGTLSSVTSARALR??HGPPRPGALTPT?SLSSQAL?S?RLFTTDGAHPQ?ISP 
I PGG\^S S SGLSRMGAVPVMVPAQSQAGSLV 
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FIGURE IS 

CGCCACCACTGCGGCCACCGCCA 

xMET {trar-s = l-£, dir=f, res = l}> 

ATC-AA-ACGCCTCCCGCTCC7AGTGGTTTTTTCCACTTTGTTGAATTGTTCCTATACTCAA 
AATTGCACCAAGACACCTTGTCTCCCAAATGCAAAATGTGAAATACGCAATGGAATTGAA 
GCCTGCTATTGCAACATGGGATTTTCAGGAAATGGTGTCACAATTTGTGAAGATGATAAT 
GAATGTGGAAATTTAACTCAGTCCTGTGGCGAAAATGCTAATTGCACTAACACAGAAGGA 
AGTTATTATTGTATGTGTGTACCTGGCTTCAGATCCAGCAGTAACCAAGACAGGTTTATC 
ACTAATGATGGAACCGTCTGTATAGAAAATGTGAATGCAAACTGCCATTTAGATAATGTC 
TGTATAGCTGCAAATATTAATAAAACTTTAACAAAAATCAGATCCATAAAAGAACCTGTG 
GCTTTGCTACPAGAAGTCTATAGAAATTCTGTGACAGATCTTTCACCAACAGATATAATT 
ACATATATAGAAATATTAGCTGAATCATCTTCATTACTAGGTTACAAGAACAACACTATC 
TCAGCCAAGGACACCCTTTCTAACTCAACTCTTACTGAATTTGTAAAAACCGTGAATAAT 
TTTGTTCAAAGGGATACATTTGTAGTTTGGGACAAGTTATCTGTGAATCATAGGAGAACA 
CATCTTACAAAACTCATGCACACTGTTGAACAAGCTACTTTAAGGATATCCCAGAGCTTC 
CAAAAGACCi^CAGAGrTTGATACAAATTCAACGGATATAGCTCTCAAAGTTTTCTTTTT^ 
GATTCATATAACATGAAACATATTCATCCTCATATGAATATGGATGGAGACTACATAAAT 
ATATTTCCAAAGAGAAAAGCTGCATATGATTCAAATGGCAATGTTGCAGTTGCATTTTTA 
TATTATAAGAGTATTGGTCCTTTGCTTTCATCATCTGACAACTTCTTATTGAAACCTCAA 
AATTATGATAATTCTGAAGAGGAGGAAAGAGTCATATCTTCAGTAATTTCAGTCTCAATG 
AGCTCAAACCCACCCACATTATATGAACTTGAAAAAATAACATTTACATTAAGTCATCGA 
AAGGTCACAGATAGGTATAGGAGTCTATGTGCATTTTGGAATTACTCACCTGATACCATG 
AATGGCAGCTGGTCTTCAGAGC-GCTGTGAGCTGACATACTCAAATGAGACCCACACCTCA 
TGCCGCTGTAATCACCTGACACATTTTGCAATTTTGATGTCCTCTGGTCCTTCCATTGGT 
ATTAAAGATTATAATATTCTTACAAGGATCACTCAACTAGGAATAATTATTTCACTGATT 
TGTCTTGCCATATGCATTTTTACCTTCTGGTTCTTCAGTGAAATTCAAAGCACCAGGACA 
ACAATTCACAAAAATCTITGCTGTAGCCTATTTCTTGCTGAACTTGTTTTTCTTGTTGGG 
ATCAATACAAATACTAATAAGCTCTTCTGTTCAATCATTGCCGGACTGCTACACTACTTC 
TTTTTAGCTGCTTTTGCATGGATGTGCATTGAAGGCATACATCTCTATCTCATTGTTGTG 
GGTGTCATCTACAACAAGGGATTTTTGCACAAGAATTTTTATATCTTTGGCTATCTAAGC 
CCAGCCGTGGTAGTTGGATTTTCGGCAGCACTAGGATACAGATATTATGGCACAACCAAA 
GTATGTTGGCTTAGCACCGAAAACAACTTTATTTGGAGTTTTATAGGACCAGCATGCCTA 
ATCATTCTTGTTAATCTCTTGGCTTTTGGAGTCATCATATACAAAGTTTTTCGTCACACT 
GCAGGGTTGAAACCAGAAGTTAGTTGCTTTGAGAACATAAGGTCTTGTGCAAGAGGAGCC 
CTCGCTCTTCTGTTCCTTCTCGGCACCACCTGGATCTTTGGGGTTCTCCATGTTGTGCAC 
GCATCAGTGGTTACAGCTTACCTCTTCACAGTCAGCAATGCTTTCCAGGGGATGTTCATT 
TTTTTATTCCTGTGTGTTTTATCTAGAAAGATTCAAGAAGAATATTACAGATTGTTCAAA 
A.ATGTCCCCTGTTGTTTTGGATGTTTAAGGTAAACATAGAGAATGGTGGATAATTACAAC 
TGCACAAAAATAAAAJS.TTCCAAGCTGTGGATGACCAATGTATAAAAATGACTCATCAAAT 
TATCCAATTATTAACTACTAGACAAAAAGTATTTTAAATCAGTTTTTCTGTTTATGCTAT 
AGGAACTGTAGA.TAATAAGGTAAAATTATGTATCATATAGATATACTATGTTTTTCTATG 
TGAAATAGTTCTGTCAAAAATAGTATTGCAGATATTTGGAAAGTAATTGGTTTCTCAGGA 
GTGATATCACTGCACCCAAGGAAAGATTTTCTTTCTAACACGAGAAGTATATGAATGTCC 
TGAAGGAAACCACTGGCTTGATATTTCTGTGACTCGTGTTGCCTTTGAAACTAGTCCCCT 
ACCACCTCGGTAATGAGCTCCATTACAGAAAGTGGAACATAAGAGAATGAAGGGGCAGAA 
T^-TCAAACAGTGAAAAGGGAATGATAAGATGTATTTTGAATGAACTGTTTTTTCTGTAGA 
CTAGCTGAGAAATTGTTGACATAAAATAAAGAATTGAAGAAACACATTTTACCATTTTGT 
GAATTGTTCTGAACTTAAATGTCCACTAAAACAACTTAGACTTCTGTTTGCTAAATCTGT 
TTCTTTTTCTAATATTCTAAAAAAAAAAAAAAGGTTTACCTCCACAAATTGAAAAAAAAA 
AAA-AAAAAAAAAAAAAAAAjyVAAAAAAAAAAAAAAAAAA 
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FIGURE 19 

></uEr/£ecdb2/£St/DNA/Dnaseq£ .min/ss.DNA3 3 0 92 
xsubunit 1 of 1, €9C aa, C step 
xMW: .77825, pi: 7.95, NX (S/Tl : 11 

r-^RLP:.LV^/?STLL^CSYTQNCTKT?CL?NAJCCEIRNGIEACY2NMGFSGNGVTICEDDN 

ECGNLTQSCGENANCTNTEGSYYCMCVPGFRSSSNQDRFITNDGTVCIENVNANCHLDNV 

C I AAKI NKTLTKIRS I KEPVALLQE VTRNS VTDLS PTD 1 1 TYI E IIJ^S S SLLGYKNOT I 

SAKDTLSNSTLTEFVTCTVl^FVQRDTFVVWDKLSVNHRRTHLTKIJ^ 

QKTTEFDTNSTDIALKVFFFDSYNMKHIKPHMNMDGDYINIFPKRKM^^ 

YYKSIGPLLSSSDNFLLKPQNYBNSSEEERVISSVISVSMSSNPPTLYELEKITFTLSHR 

KVTDRYRSLCAFWI^SPDTMNGSWSSEGCELTYSKETHTSCRCNI^IHFAIIJ^ 

IKIDYKILTRITQLGIIISLICIAICIFTFWFFSEIQSTRTTIHKNLCCSLFLAELVFLVG 

iktntnxlfcsiiagli^iyffij^fawmciegihlyliwgviynkgflh:^^ 

PAVWGFSAALGYRYYGTTKVCWLSTENT^IWSFIGPACLIILVNLLAFGVIIYKVFRH^ 
aGLKPEVS CFEN IRS CARGALALLFLLGTTWI FGVLKVVHASVVTAYLFTVSNAFQGMF I 
FLFLC7LSRKIQEEYYRLFKNVPCCFGCLR 
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FIGTIRE 20 

ATAGGAGTCTATGTGGCATrTTGGAATACrCACCTGATACCATGAATGGCAGCTGGrCTTCA 
C-AGGGCTGTGAGCTGACATACTCAAATGAGACCCACACCTCA.TGCCGCTGTAATGACCTGAC 
ACATTT.TGCAATTTTGATGrGCTCTGGTCCTTCCATTGGTATTAAAGATTATAATATTCTTA 
CAAGGATCACTCA-ACTAGGAATAATTArTTCACTGATTTGTCTTGCCATATGCATTTTTACC 
TTCTGGTTCTTCAC-TGAAATTCAAAGCACCAGGA 
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ctcccagcc^-tgaaccrcg3ggccgc?gcgcggtgg3gaggagttccccg;iazi-cccggccgc 
ta;^x-cgaggcctcctcctcccggagatccg;ijlcggcctgggcggggtcaccccggctgggac 
a.ag;^-Agccgccgcctgcctgcccgggcccggggagggggctggggctggggccggaggcggg 
gtgtc-agtgggtgtgtgcggggggcggaggcttgatgcagtcccgataagaaatgctcgggt 

GTCTTGGGCACCTACCCGTGGGGGCCGTAA-GGCGCTACTATATAACGCTGCCGGCCCTGAGC 

cgccgagccgtccgagcaggagcgctgcgtccaggatctagggcacgaccatcccaacccgg 
cattcacagccccgcagcgcatccggtcgccggccaggttccgcaccccatcgccggagctg 

CGCCGAGAGCCCCAGGGAGGTGCCATGCGGAGCGGGTGTGTGGTGGTCCACGTATGGATCCT 

ggccggcctctggctggccgtggccgggcgccccctcgccttctcggacgcggggccccacg 
tgcactacggctggggcgaccccatccgcctgcggcacctgtacacctccggcccccacggg 
ctctccagctgcttcctgcgcatccgtgccgacggcgtcgtggactgcgcgcggggccagag 
cgcgcacagtttggtggagatcaaggcagtcgctctgcggaccgtggccatoiagggcgtgc 
acagcgtgcggtacctctgcatgggcgccgacggcaagatgcaggggctgcttcagtactcg 
gaggaagactgtgctttcgaggaggagatccgcccagatggctacaatgtgtaccgatccga 
gaagcaccgcctcccggtctccctgagcagtgccaaacagcggcagctgtacaagaacagag 
gctttcttccactctctcatttcctgcccatgctgcccatggtcccagaggagcctgaggac 
ctcaggggccacttggaatctgacatgttctcttcgcccctggagaccgacagcatggaccc 
atttgggcttgtcaccggactggaggccgtgaggagtcccagctttgagaagtaactgagac 
catgcccgggcctcttcactgctgccaggggctgtggtacctgcagcgtgggggacgtgctt 
ctacaagaacagtcctgagtccacgttctgtttagctttaggaagaaacatctagaagttgt 
acatattcagagttttccattggcagtgccagtttctagccaatagacttgtctgatcataa 
cattggaagccttgtacttggcccagctgttgcctgggccccccattctgctccctcgaggt 
tgctggacaagctgctgcactgtctcagttctgcttgaatacctccatcgatggggaactca 

C T T C C T T T GGAAAAAT T CT T ATGTCAAGCT GAAATTCTCTAATTT TTCTCATCACTTCCCCA 
GGAGCAGCCAGAAGACAGGCAGTAGTTTTAATTTCAGGAACAGGTGATCCACTCTGTAAAAC 
AGCAGGTAAJi.TTTCACTCAACCCCATGTGGGAATTGATCTATATCTCTACTTCCAGGGACCA 

tttgcccttcccaaatccctccaggccagaactgagtggagcaggcatggcccaccaggctt 
cagaa.gtaggggaagcctggagccccactccagccctgggacaacttgagaattccccctga 
ggccagttc?gtcatggatgcrg?cctgagaataacttgctgtcccggtgtcacctgcttcc 

ATCTCCCAGCCCACCAGCCCTGTGCCCACCTCACATGCCTCCCCATGGATTGGGGCCTCCCA 
GGCCCCCCACCTTATGTCAACCTGCACTTCTTGTTCAAAAATCAGGAAAAGAAAAGATTTGA 
AGACCCCA^i.GTCTTGTCAATAACTTGCTGTGTGGAAGCAGCGGGGGAAGACCTAGAACCCTT 
TCCCGAGCACTTGGTTTTCCAACATGATATTTATGAGTAATTTATTTTGATATGTACATCTC 
TTA7T,TCTTACATTATTTATGCCCCC;iAATTATATTTATGTATGTAAGTGAGGTTTGTTTT 
GTATATTAAAATGGAGTTTGTTTGT 
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FIGimE 22 
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FTGTJRF 2^ 

CCCAGA^.GTTCAAGGGCCCCCGGCCTCCTGCGCTCCTGCCGCCGGGACCCTCGACCTCCTCA 
GAGCAGCCGGCTGCCGCCCCGGGAAGATGGCGAGGAGGAGCCGCCACCGCCTCCTCCTGCTG 
CTGCTGCGGTACCTGGTGGTCGCCCTGGGCTATCATAAGGCCTATGGGTTTTCTGCCCCAAA 
AGACCA;iXAAGTAGTCACAGCAGTAGAGTACCAAGAGGCTATT?TAGCCTGCAAAACCCCAA 
AGAAGACTGTTTCCTCCAGATTAGAGTGGAAGAAACTGGGTCGGAGTGTCTCCTTTGTCTAC 
TATCAACAGACTCTTCAAGGTGATTTTAAAAATCGAGCTGAGATGATAGATTTCAATATCCG 
GATCAAAAATGTGACAAGAAGTGATGCGGGGAAATATCGTTGTGAAGTTAGTGCCCCATCTG 
AGCA-A.GGCCAAAACCTGGAAGAGGATACAGTCACTCTGGAAGTATTAGTGGCTCCAGCAGTT 
CGATCATGTGAAGrACCCTCTTCTGCTCTGAGTGGAACTGTGGTAGAGCTACGATGTCAAGA 
CAAAG^J^GGGAATCCAGCTCCTGAATACACATGGTTTAAGGATGGCATCCGTTTGCTAGAAA 
ATCCCAGAGTTGGGTCCCAJU1.GCACCAACAGCTCATACACAATGAATACAAAAACTGGAACT 
CTGCAATTTAATACTGTTTCCAAACTGGACACTGGAGAATATTCCTGTGAAGCCCGCAATTC 
TGTTGGATATCGCAGGTGTCCTGGGAAACGAATGCAAGTAGATGATCTCAACATAAGTGGCA 
TCA.TAGCAGCCGTAGTAGTTGTGGCCTTAGTGATTTCCGTTTGTGGCCTTGGTGTATGCTAT 
GCTCAGAGGAAAGGCTACTTTTCAAAAGAAACCTCCTTCCAGAAGAGTAATTCTTCATCTAA 
AGCCACGACAJ^TGAGTGAAAATGTGCAGTGGCTCACGCCTGTAATCCCAGCACTTTGGAAGG 
CCGCGGCGGGCGGATCACGAGGTCAGGAGTTCTAGACCAGTCTGGCCAATATGGTGAAACCC 
CATCTCTACTAAAATACAAAAATTAGCTGGGCATGGTGGCATGTGCCTGCAGTTCCAGCTGC 
TTGGGAGACAGGAGAATCACTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCTGAGATCACGC 
CACTGCAGTCCAGCCTGGGTAACAGAGCAAGATTCCATCTCAAAAAATAAAATAAATAAATA 
A.:iTAAATAGTGGTTTTTACCTGTAGAATTCTTACAATAAATATAGCTTGATATTC 
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FIGT JRE 2 4 

><ss .DNA35€3e 

xEubunit 1 cf 1, 312 aa , 1 stop 
xMW: 34554, pi: S.39, NX (S/T) : 4 

FJLORSRHRLLLLLLRYLWALGYHKAYGFSAPKDOQVVTAVEYQEAIIJ^CKTPKKT^ 

LEVnGCLGRSVSFVTYQQTI^GDFKNRAEMIDFNIRIKWniSDAGKYR 

LEEDTVTLE^\;LVA?AVPSCEV?£SALSGTVVELRCQDKEGNPAPEYTWFKDGIRLLENPR 

lgs q s tns s ytmntktgtlqfntvs kldtge ys cearns vgyrrcpgkrmqvddlni sgi 
iaa'aa/-valvisvcg:.gvcyaqrkgyfsketsfqksnssskattmsenvqwltpvipalw 
kaaaggsrgqef 
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FTGTJRE 25 

gacatcggaggtgggctagcactgaaactgcttttcaagacgaggaagaggaggagaa;igag 

AAAGAAGAGGAJIGATGTTGGGCAJ^CATTTATTTAACATGCTCCACAGCCCGGACCCTGGCAT 
CATGCTGCTATTCCTGCAAJiTACTGAA.GAJ\GCATGGGATTTA.zATATTTTACTTCTAAATAA 
ATGAATTACTCAA-CTCCTATGACCATCTATACATACTCCACCTTCAAAAAGTACATCAATA 
TTATATCATTAAGGAAATAGTAACCTTCTCTTCTCCAATATGCATGACATTTTTGGACAATG 
CAATTGTGGCACTGGCACTTATTTCAGTGAAGAAAAACTTTGTGGTTCTATGGCATTCATCA 
TTTGACAAATGCAAGCATCTTCCTTATCAATCAGCTCCTATTGAACTTACTAGCACTGACTG 
TGGAJi.TCCTTAAGGGCCCATTACATTTCTGAAGAAGAAAGCTAAGATGAAGGACATGCCACT 
CCGAATTCATGTGCTACTTGGCCTAGCTATCACTACACTAGTACAAGCTGTAGATAAAAAAG 
TGGATTGTCCACGGTTATGTACGTGTGAAATCAGGCCTTGGTTTACACCCAGATCC;iTTTAT 
ATGGAAGCATCTACAGTGGATTGTAATGATTTAGGTCTTTTAACTTTCCCAGCCAGATTGCC 
AGCTAACACACAGATTCTTCTCCTACAGACTAACAATATTGCAAAAATTGAATACTCCACAG 
ACTTTCCAGTAAACCTTACTGGCCTGGATTTATCTCAAAACAATTTATCTTCAGTCACCAAT 
ATTAATGTAAAAAAGATGCCTCAGCTCCTTTCTGTGTACCTAGAGGAAAACAAACTTACTGA 
ACTGCCTGAAAAATGTCTGTCCGAJICTGAGCAACTTACAAGAACTCTATATTAATCACAACT 
TGCTTTCTACAATTTCACCTGGAGCCTTTATTGGGCTACATAATCTTCTTCGACTTCATCTC 
AATTCAAATAGATTGCAGATGATCAACAGTAAGTGGTTTGATGCTCTTCCAAATCTAGAGAT 
TCTGATGATTGGGGAAAATCCAATTATCAGAATCAAAGACATGAACTTTAAGCCTCTTATCA 
ATCTTCGCAGCCTGGTTATAGCTGGTATAAACCTCACAGAAATACCAGATAACGCCTTGGTT 
GGACTGGAAAACTTAGAAAGCATCTCTTTTTACGATAACAGGCTTATTAAAGTACCCCATGT 
TGCTCTTCAAAAAGTTGTAAATCTCAAATTTTTGGATCTAAATAAAAATCCTAITAATAGAA 
TACGAAGGGGTGATTTTAGCAATATGCTACACTTAAAAGAGTTGGGGATAAATAATATGCCT 
GAGCTGATTTCCATCGATAGTCTTGCTGTGGATAACCTGCCAGATTTAAGAAAAATAGAAGC 
TACTAACAACCCTAGATTGTCTTACATTCACCCCAATGCATTTTTCAGACTCCCCAAGCTGG 
AATCACTCATGCTGAACAGCAATGCTCTCAGTGCCCTGTACCATGGTACCATTGAGTCTCTG 
CCAAACCTCAAGGAAATCAGCATACACAGTAACCCCATCAGGTGTGACTGTGTCATCCGTTG 
GATGAACATGAACAAAACCAACATTCGATTCATGGAGCCAGATTCACTGTTTTGCGTGGACC 
CACCTGAATTCCAAGGTCAGAATGTTCGGCAAGTGCATTTCAGGGACATGATGGAAATTTG-^ 
CTCCCTCTTATAGCTCCTGAGAGCTTTCCTTCTAATCTAAATGTAGAAGCTGGGAGCTATGT 
TTCCTTTCACTGTAGAGCTACTGCAGAACCACAGCCTGAAATCTACTGGATAACACCTTCTG 
G . CAAAAACTCTTGCCTAJITACCCTGACAGACAAGTTCTATGTCCATTCTGAGGGAACACTA 
GATATAAATGGCGTAACTCCCAAAGAAGGGGGTTTATATACTTGTATAGCAACTAACCTAG'T' 
TG G CG CTGACTTGAAGTCTGTTATG ATCAAAGTGGATGGATCTTTTCCACAAGATAACAATG 
G CT CTTTGTIATATTAJ^AATAAGAGATATTCAGGCCAATTCAGTTTTGGTGTCCTGGAAAGCA 
AGTTCTAA-AATTCTCAAATCTAGTGTTAAATGGACAGCCTTTGTCAAGACXaAAAATTCTCA 
.GGTGCGCAAAGTGCTCGAATACCATCTGATGTCAAGGTATATAATCTTACTCATCTGAATC 
CATC/^CTGAGTATAJIAATTTGTATTGATATTCCCACCATCTATCAGAAAAACAGAAAAAAA 
TGTGTAAJ^TGTCACCACCAAAGGTTTGCACCCTGATCAAAAAGAGTATGAAAAGAATAATAC 
CACAACACTTATGGCCTGTCTTGGAGGCCTTCTGGGGATTATTGGTGTGATATGTCTTATCA 
GCTGCCTCTCTCCAGAAATGAACTGTGATGGTGGACACAGCTATGTGAGGAATTACTTACAG 
AA^.CCAACCTTTGCA.TTAGGTGAGCTTTATCCTCCTCTGATAAATCTCTGGGAAGCAGGAAA 
AG^J^J^AAGTACATC?.CTGAAAGTAAAAGCAACTGrrATAGGTTTACCAACAAATATGTCCTA^ 
AAACCACCAAGGAAACCTACTCCAAAAATGAAC ^-^^^i^ i i aa 
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FTGTJRF 26 

M:<C>IFLRi:^;LlGLAirTLVQAVDKK:vT)C?RLCTCEIRFW?TPRSIYME^STVDC^ 
LLTFFARLFPJ^TQILLLQTNNIAKIEYSTDFPVKLTGLDLSQl^LSSVTO 

lsvyleenklte:.?ekclselsnlqelyinhnllstispgafiglhnllrlhlnsnrlq 

MINSKWFDALPNLEILMIGENFIIRIKDMN7KPLINLRSLVIAGINLTEIPDNALVGLE 
NLESISFYDNRLIKVPHVALQKVVl^KFI^LJrKNPINRIRRGDFSNMLHLK^ 
ELISIDSLAVDKLPDIRKIEATNNPRLSYIHPNAFFRLPKLESLKLNSNALSALYHGTI 
E S LPNLKE I S I HS N? I RCDCV I RWMNMNKTITIRFMEPDSLFCVD? PE FQGQNVRQ VHFR 
DMMEICLPLIAPESFPSNLNVEAGSYVSFHCRATAEPQPEIYWITPSGQKLLFNTLTDK 
F^^^HSEGTLDINGVTPKEGGLYTCIATNLVGADLKSVMIKVDGSFPQDNNGSLNIKIRD 
I QANS VLVS WKAS S K I LKS S VKWTAFVKTENSKAAQS AR I PSD VKVYNLTHLNP STEYK 
I C I D I PTI YQKNRKKCW^TKGLHPDQKEYEKNNTTTLMACLGGLLGI IGVI CL I SCL 
SFEr^ICDGGHSYVRNYLQKPTFALGELYPPLINLWEAGKEXSTSLKVKATVIGLPTNMS 
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HGURE 27 

G C C C C-G GACT GG CGCAAGGTGCCCAAGCAAGGAAAGAAATA.:^TGAAGAGACACATGTGTTAG 
CTGCAGCCTTTTGAAACACGCAAGAAGGAAATCAATAGTGTGGACAGGGCTGGAACCTTTAC 
CAGGCTTGTTGGAGTAGATGAGGAATC-GGCTCGTGATTATGCTGAGATTCCAGCATGAATCT 
GGTAGACCTGTGGTTAACCCGTTCCCTCTCCATGTGTCTCCTCCTACAAAGTTTTGTTCTTA 
TGATACTGTGCTTTCATTCTGCCAGTATGTGTCCCAAGGGCTGTCTTTGTTCTTCCTCTGGG 
GGTTTAAATGTCACCTGTAGCAATGCAAATCTCAAGGAAATACCTAGAGATCTTCCTCCTGA 
AACAGTCTTACTGTATCTGGACTCCAATCAGATCACATCTATTCCCAATGAAATTTTTAAGG 
ACCTCCATCAACTGAGAGTTCTCAACCTGTCCAAAAATGGCATTGAGTTTATCGATGAGCAT 
GCGTTCAAAGGAGTAGCTGAAACCTTGCAGACTCTGGACTTGTCCGACAATCGGATTCAAAG 
TGTGCACAAAAATGCCTTCAATAJ^CCTGAAGGCCAGGGCCAGAATTGCCAACAACCCCTGGC 
AGTGCGACTGTACTCTACAGCAAGTTCTGAGGAGCATGGCGTCCAATCATGAGACAGCCCAC 
AACGTGATCTGTAAAACGTCCGTGTTGGATGAACATGCTGGCAGACCATTCCTCAATGCTGC 
CAACGACGCTGACCTTTGTAACCTCCCTAAAAAAACTACCGATTATGCCATGCTGGTCACCA 
TGTTTGGCTGGTTCACTATGGTGATCTCATATGTGGTATATTATGTGAGGCAAAATCAGGAG 
GATGCCCGGAGACACCTCGAATACTTGAAATCCCTGCCAAGCAGGCAGAAGAAAGCAGATGA 
ACCTGATGATATTAGCACTGTGGTATAGTGTCCAAACTGACTGTCATTGAGAAAGAAAGAAA 
GTAGTTTGCGATTGCAGTAGAAATAAGTGGTTTACTTCTCCCATCCATTGTAAACATTTGAA 
ACTTTGTATTTCAGTTTTTTTTGAATTATGCCACTGCTGAACTTTTAACAAACACTACAACA 
TAAATAATTTGAGTTTAGGTGATCCACCCCTTAATTGTACCCCCGATGGTATATTTCTGAGT 
AAGCTACTATCTGAACATTAGTTAGATCCATCTCACTATTTAATAATGAAATTTATTTTTTT 
AATTTAAA-Zi.GCAAATAAAAGCTTAACTTTGAACCATGGGAAAAAAAAAAAAAAAAAAAAAAAACA 
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FIGURE 28 

xss .CNA3 5 08 5 

xs-obunit 1 of 1, 25 5 aa, 1 step 
xMW: 25275, pi: €.92, ITK{S/T}: 2 

MNLV-DLWITRSLSMCLLLOSFVLMILCFKSASMCPKGCLCSSSGGLNVTCSNAICLKZIPR 
DLP?ETVLLYLDSKQITSIFNEIFKDLHQLRV1JCLSKNGIEFIDEKA?KGVA£TLQTLDL 
SDNTa QSV"riKNAF^:l^KARARI ANl^PWHCDCTLQQVLRSMAS^TIET^^^^ CKTSVLDEH 
A2-RP F LNAA^^3ABL CNLP KKT TD Y AMLVTMrGWFTMV I S YVVYYVRQNQEDARRHLEYLK 
SLFSRQKKACE?DDISTW 
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FIGURE 29 

A C CGA G C C GAG C G GA C CG AAG G GG CG C C CG AG ATG CAGG TGAG CAJ^.G AGGA.TG CTGG CGG GG 
GGCGTGAGGAGGATGCCCAGCCCCGTCGTGGCCTGCTGGCAGCCCATCCTCCTGCTGGTGGT 
GGGCTCAGTGCTGTGAGGGTCGGCGACGC-GCTGCCCGCCCCGCTGCGAGTGCTCCGGCGAGG 
ACCGCGCTGTGCTGTGCCACCGCAAGTGCTTTGTGGCAGTCCCCGAGGGCATCCCCACCG2.G 
ACGCGGCTGCTGC-ACCTAGGCAAGAACCGCATCAA-AACGCTCAACCAGGACGAGTrCGGCAG 
CTTCGCGCACCTGGAGGAGCTGGAGGTCAACGAGAACATCGTGAGCGCCGTGGAGCCCGGCG 
CC7TCAACAACCTCTTCAACCTCCGGACGCTGGGTCTCCGCAGCAACCGCCTGAAGCTCATC 
CCGCTAGGCGTCTTCACTGGCCTCAGCAACCTGACCAAGCAGGACATCAGCGAGAACJ^GAT 
CGTTATCCTACTGGACTACATGTTTCAGGACCTGTACAACCTCAAGTCACTGGAGGTTGGCG 
ACAATGACCTCGTCTACATCTCTCACCGCGCCTTCAGCGGCCTCAACAGGCTGGAGCAGCTG 
ACGCTGGAGAA^TGCAACCTGACCTCCATCCCCACCGAGGCGCTGTCCCACCTGCACGGCCT 
CATCGTCCTGAGGCTCCGGCACCTCAACATCAATGCCATCCGGaACTACTCCTTCAAGAGGC 
TGTACCGACTCAAGGTCTTGGAGATCTCCCACTGGCCCTACTTGGACACCATGACACCCAAC 
TGCCTCTACGGCCTCAACCTGACGTCCCTGTCCATCACACACTGCAATCTGACCGCTGTGCC 
CTACCTGGCCGTCCGCCACCTAGTCTATCTCCGCTTCCTCAACCTCTCCTACAACCCCATCA 
GCACCATTGAGGGGTCCATGTTGCATGAGCTGCTCCGGCTGCAGGAGATCCAGCTGGTGGGC 
GGGCAGCTGGCCGTGGTGGAGCCCTATGCCTTCCGCGGCCTCAACTACCTGCGCGTGCTCAA 
TGTCTCTGGCAACCAGCTGACCACACTGGAGGAATCAGTCTTCCACTCGGTGGGCAACCTGG 
AGACACTCATCCTGGACTCCAACCCGCTGGCCTGCGACTGTCGGCTCCTGTGGGTGTTCCGG 
CGCCGCTGGCGGCTCAACTTCT^CCGGCAGCAGCCCACGTGCGCCACGCCCGAGTTTGTCCA 
:;G^C^GGAGTTCAAGGACTTCCCTGATGtGCTACTGCCCAACTACTTCACCTGCCGCCGCG 
CCCGCATCCGGGACCGCAAGGCCCAGCAGGTGTTTGTGGACGAGGGCCACACGGTGCAGTTT 
GTGTGCCGGGCCGATGGCGACCCGCCGCCCGCCATCCTCTGGCTCTCACCCCGAAAGCACCT 
GGTCTCAGCCAAGAGCAATGGGCGGCTCACAGTCTTCCCTGATGGCACGCTGGAGGTGCGCT 
ACGCCCAGGTACAGGACAACGGCACGTACCTGTGCATCGCGGCCAACGCGGGCGGCAACGAC 
TCCATGCCCGCCCACCTGCATG7GCGCAGCTACTCGCCCGACTGGCCCCATCAGCCCAACAA 
GA.CCTTCGCTTTCATCTCCAACCAGCCGGGCGAGGGAGAGGCCAACAGCACCCGCGCCACTG 
TGCCTTTCCCCTTCGACATCAAGACCCTCATCATCGCCACCACCATGGGCTTCATCTCTTTC 
CTGGGCGTCGTCCTCTTCTGCCTGGTGCTGCTGTTTCTCTGGAGCCGGGGCAAGGGCAACAC 
AAAGCACAACATCGAGATCGAGTATGTGCCCCGAAAGTCGGACGCAGGCATCAGCTCCGCCG 
ACGCGCCCCGCAAGTTCAACATGAAGATGATATGAGGCCGGGGCGGGGGGCAGGGACCCCCG 
GGCGGCCGGGCAGGGGAAGGGGCCTGGTCGCCACCTGCTCACTCTCCAGTCCTTCCCACCTC 
CTCCCTACCCTTCTACACACGTTCTCTTTCTCCCTCCCGCCTCCGTCCCCTGCTGCCCCCCG 
CCAGCCCTCACCACCTGCCCTCC7TCTACCAGGACCTCAGAAGCCCAGACCTGGGGACCCCA 
CCTACACAGGGGCATTGACAGACTGGAGTTGAAAGCCGACGAACCGACACGCGGCAGAGTCA 
ATAJ^TTCAJ^.TAAAAAJi.GTTACGAACTTTCTCTGTAACTTGGGTTTCAATAATTATGGATTTT 
TATGAAAACTTGAAATAATAAAAAGAGAAAAAAACTAJ^AAAAAAAAAAAAAAAAAAAA 
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FTOTJRE 30 

xsubuni" 1 of 1, 52C aa, i step 
><m: .69536, pi: 8.54, NXCS/T): 10 

KQVSKRMLAGGTn^I^SMPSPLIJICWQPILLLVLGSVLSGSATGCPPRCSCSAQDRA'^ 

C?VAVPSG I PTSTPXLDLGKNR- KTIJJQDEFASFFHLEELELNENI VSAVEPGAFNTujFN 

LRTLGLRSNRLKLIPLGVFTGLSNLTKQDISENKIVILLDYNFQDLYNLKSLEVGDNDLV 

YISPaAFSGIxNSLEQLTLEKCNLTSIPTEALSHLHGLJVLRLRHLNINAIRDYSFKRLYR 

LKVLE ISnr^PYLDIMTPNCLYGl^LTSLS ITHCNLTAVPYI^VRKLVYLRFLNLSYNPIS 

TIEGS^C:iKELLRLQEIQLVGGQI^Vv'EPYAFRGL^n'LRVLNVSGNQLTTLEES^^ 

LETLZLDSNFIJ^CDCRLLWVFRRRWRLNFNRQCPTCATPEFVQGKEFKDFPDVLLPNYFT 

CRRARI RDRKAQQ VFVDEGHTVQFVCRADGD ? P P AI LWLS PRKKLVS AKSNGRLTVFPDG 

TLEVRYAQVQDNGTYLCIAANAGGNDSMPAHLHVRSYSPDWPHQPNKTFAFISNQPGEGE 

ANSTRATVPFFFBIKTLIIATTMGFISFLGWLFCLVLLFLWSRGKGNTKHNIEIEYVPR 

KSDAGISSADAFRKFNMKMI 



31 / 128 



wo 99/143:8 



PCT/US98/19330 



FTGIJRE 31 

CCCACGCGTCCGCACCTCGGCCCCGGGCTCCGAAGCGGCTCGGGGGCGCCCTTTCGGTCAAC 

ATCGTAGTCCACCCCCTCCCCATCCCCAGCCCCCGGGGATTCAGGCTCGCCAGCGCCCAGCC 

AGGGAGCCGGCCGGGAAGGGCGATGGGGGCCCCAGCCGCCTCGCTCCTGCTCCTGCTGCTGC 

TGTTCGCCTGCTGCTGGGGGCCCGGCGGGGCCAACCTCTCCCAGGACGACAGCCAGCCGTGG 

ACATCTGATGAAACAGTGGTGGCTGGTGGCACCGTGGTGCTCAAGTGCCAAGTGAAAGATCA 

CGAGGACTCATCCCTGCAATGGTCTAAGCCTGCTCAGCAGACrCTCTACTTTGGGGAGAAGA 

GAGCCCTTCGAGATAATCGAATTCAGCTGGTTACCTCTACGCCCCACGAGCTCAGCATCAGC 

ATCAGCAJlTGTGGCCCTGGCAGACGAGGGCGAGTACACCTGCTCAATCTTCACTATGCCTGT 

GCGAACTGCCAAGTCCCTCGTCACTGTGCTAGGAATTCCACAGAAGCCCATCATCACTGGTT 

ATAJATCTTCATTACGGGAAJiAAGACACAGCCACCCTAAACTGTCAGTCTTCTGGGAGCAAG 

CCTGCAGCCCGGC7CACCTGGAGAAAGGGTGACCAAGAACTCCACGGAGAACCAACCCGCAT 

ACAGGA-AGA.TCCCAATGGTAAAACCTTCACTGTCAGCAGCTCGGTGACATTCCAGGTTACCC 

GGGAGGATGATGGGGCGAGCATCGTGTGCTCTGTGAACCATGAATCTCTAAAGGGAGCTGAC 

AGATCCACCTCTCAACGCATTGAAGTTTTATACACAGCAACTGCGATGATTAGGCCAGACCC 

TCCCCATCCTCGTGAGGGCCAGAAGCTGTTGCTACACTGTGAGGGTCGCGGCAATCCAGTCC 

CCCAGCAGTACCTATGGGAGAAGGAGGGCAGTGTGCCACCCCTGAAGATGACCCAGGAGAGT 

GCCCTGATCTTCCCTTTCCTCAACAAGAGTGACAGTGGCACCTACGGCTGCACAGCCACCAG 

CAACATGGGGAGCTACAAGGCCTACTACACCCTCAATGTTAATGACCCCAGTCCGGTGCCCT 

CCTCCTCCAGCACCTACCACGCCATCATCGGTGGGATCGTGGCTTTCATTGTCTTCCTGCTG 

CTCATCATGCTCATCTTCCTTGGCCACTACTTGATCCGGCACAAAGGAACCTACCTGACACA 

TGAGGCAAAAGGCTCCGACGATGCTCCAGACGCGGACACGGCCATCATCAATGCAGAAGGCG 

GGCAGTCAGGAGGGGACGACAAGAAGGAATA.TTTCATCTAGAGGCGCCTGCCCACTTCCTGC 

GCCCCCCAGGGGCCCTGTGGGGACTGCTGGGGCCGTCACCAACCCGGACTTGTACAGAGCAA 

CCGCAGGGCCGCCCCTCCCGCTTGCTCCCCAGCCCACCCACCCCCCTGTACAGAATGTCTGC 

TTTGGGTGCGGTTTTGTACTCGGTTTGGAATGGGGAGGGAGGAGGGCGGGGGGAGGGGAGGG 

TTGCCCTCAGCCCTTTCCGTGGCTTCTCTGCATTTGGGTTATTATTATTTTTGTAACAATCC 

CAAJlTCAAATCTGTCTCCAGGCTGGAGAGGCAGGAGCCCTGGGGTGAGAAAAGCAAAAAACA 
AJiCAJUiAAACA 
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FTGTJRR 32 

MGAPAA5LLLLLLLFACCWAPGGANLSQDDSQPWTSDETVVAG3TVVLKCQVKDHEDS5 
LQWSNrAQCTLYrGZKRALRDN^lIQLVTSTPHELSISISNVAIJiDEGEYTCSIFTMPVR 
TAKSlVTVLGIFQK?IITGyKSSLREKr!TATLNCQSSGSKPAARI.TWRKGDQELHGEPT 
R I Q ED PNGKTFTVS S SVTFQVTREDDGAS I VCS VNHESLKGADRSTS QR I EVLYTPTAM 
IRPDPPHPREGQKLLLHCEGRGNPVPQQYLWEKEGSVPPLKNTQESALIFPFLNKSDSG 
T YG CTAT SNMG S YKAYYTLNVNDP S P V? S S S S TYHAI I GG I VAF I VFLLL I ML I F LGHY 
LIRHKGTYLTHEAKGSDDAPDADTAIIKAEGQQSGGDDKKEYFI 
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FIGURE 33 

GGGGGTTAGC-C-AGGAAGC-AATCCACCCCCACCCCCCaiLikACCC?TTTC'rTC"CCTTTCCTGG 
CTTCGC-ACATTGGAGCACTAAATGAACTTGAATTGTGTCTGTGGCGAGCAGGA7GG7CGCTG 
TTACTTTGTGATGAGATCGGGGATC-AATTGCTCGCTTTAAAAATGC^GCT'-'^GGATTCTGTT 
GCTGGAGACGTCTCTTTGTTTTGCCGCTGGAAACGTTACAGC-GGACGTTTGCAAAGAGAAGA 
TCTGTTCCTGCAATGAGATAGAAGGGGACCTACACGTAGACTGTGAAAAAAAGGGCTTCACA 
AGTCTGCAGCGTTTCACTGCCCCGA.CTTCCCAGTTTTACCAT7TATTTCTGCATGGCAATTC 
CCTCACTCGACTTTTCCCTAJi.TGAGTTCGCTAACTTTTATAATGCGGTTAGTTTGCACATGG 
AAAACAATGGCTTGCATGAAATCGTTCCGGGGGCTTTTCTGGGGCTGCAGCTGGTGAAAAGG 
CTGCACATCAACAACAACAAGATCAAGTCTTTTCGAAAGCAGACTTTTCTGGGGCTGGACGA 
TCTGGAATATCTCCAGGCTGATTTTAATTTATTACGAGATATAGACCCGGGGGCCTTCCAGG 
ACTTGAACAAGCTGGAGGTGCTCATTTTAAATGACAATCTCATCAGCACCCTACCTGCCAAC 
GTGTTCCAGTATGTGCCCATCACCCACCTCGACCTCCGGGGTAACAGGCTGAAAACGCTGCC 
CTATGAGGAGGTCT7GGAGCAAATCCCTGGTATTGCGGAGATCCTGCTAGAGGATAACCCTT 
GGGACTGCACC7G7GATC7GCTCTCCC7GAAAGAATGGCTGGAAAACATTCCCAJVGAATGCC 
CTGA7CGGCCGAG7GGTC7GCGAAGCCCCCACCAGAC7GCAGGGTAAAGACCTCAA7GAAAC 
CACCGAACAGGAC77GTG7CCTTTGAAAAACCGAGTGGATTC7AGTCTCCCGGCGCCCCCTG 
CCCAAGAAGAGACC777GCTCCTGGACCCCTGCCAACTCCTTTCAAGACAAA7GGGCAAGAG 
GATCA7GCCACACCAGGG7CTGCTCCAJUVCGGAGGTACAAAGATCCCAGGCAACTGGCAGAT 
CAAAA7CAGACCCACAGCAGCGA7AGCGACGGGTAGCTCCAGGAACAAACCC7TAGCTAACA 
G7T7ACCCTGCCC7GGGGGC7GCAGCTGCGACCACA7CCCAGGGTCGGGT7TAAAGATGAAC 
7GCAACAACAGGAACGTGAGCAGCTTGGC7GATTTGAAGCCCAAGCTC7CTAACGTGCAGGA 
GCT7TTCC7ACGAGA7AACAAGA7CCACAGCA7CCGAAAATCGCACTTTGTGGATTACAAGA 
ACCTCA77CTGT7GGATCTGGGCAACAATAACATCGCTACTGTAGAGAACAACACTTTCAAG 
AACCT777GGACCTCAGG7GGCTATACA7GGATAGCAATTACCTGGACACGCTGTCCCGGGA 
GAAAT7CGCGGGGCTG2AAAACCTAGAGTACC7GAACGTGGAG7ACAACGCTATCCAGCTCA 
TCCTCCCGGGCACT77CAATGCCATGCCCAAACTGAGGATCCTCATTCTCAACAACAACC7G 
C7GAGG7CCC7GCC7GTGGACG7G7TCGC7GGGGTCTCGCTC7CTAAACTCAGCCTGCACAA 
CAATTACTTCA7G7ACCTCCCGG7GGCAGGGGTGC7GGACCAG7TAACCTCCATCA7CCAGA 
7AGACCTCCACGGAAACCCC7C-GGAGTGC7CCTGCACAA7TG7GCCTTTCAAGCAG7GGGCA 
GA;i.CGC77GGG77CCGAAGTGCTGATGAGCGACC7CAAG7GTGAGACGCCGGTGAACT7CTT 
7AGAAAGGA777CA7GCTCC7C7CCAA7GACGAGATCTGCCCTCAGCTGTACGCTAGGATCT 
CGCCCACGTTAAC77CGCACAG7AAAAACAGCACTGGGTTGGCGGAGACCGGGACGCACTCC 
AACTCCTACCTAGACACCAGCAGGGTG7CCA7C7CGGTGTTGG7CCCGGGACTGCTGCTGGT 
GT77G7CACC7CCGCC77CACCG7GGTGGGCATGC7CG7GT7TATCCTGAGGAACCGAAAGC 
GG7CCAAGAGACGAGATGCCAACTCCTCCGCG7CCGAGA7TAA7TCCCTACAGACAGTCTGT 
GAC7C7TCCTACTGGCACAA.7GGGCC7TACAACGCAGA7GGGGCCCACAGAGTGTATGAC7G 
7GGCTC7CAC7CGC7CTCAGAC7AAGACCCCAACCCCAA7AGGGGAGGGCAGAGGGAAGGCG 
A7ACA7CCTTCCCCACCGCAC-GCACCCCGGGGGCTGGAGGGGCGTG7ACCCAAATCCCCGCG 
CCA7CAGCC7GGA7GGGCA7AAG7AGA7AAATAAC7GTGAGC7CGCACAACCGAAAGGGCCT 
GACCCC77AC77AGCTCCC7CCTTGAAACAAAGAGCAGACTGTGGAGAGCTGGGAGAGCGCA 
GCCAGC7CGG7CT77GCTGAGAGCCCCT77TGACAGAAAGCCCAGCACGACCC7GCTGGAAG 
AACTGACAG7GCCC7GGCCC7CGGCCCCGGGGCCTGTGGGG77GGATGCCGCGG7TCTATAC 
A7ATA7ACA7ATATCCACA7C7A7A7AC-AGAGATAGATA7CTATTTTTCCCCTGTGGATTAG 

CCCCG7GA7GGC7CCC7GTTGGCTACGCAGGGATGGGCAG7TGCACGAAGGCA7GAA7G7AT 
7G7AAATAAGTAACTT7GAC7TC7GAC 
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FTQTTRE 34 

MLL^.^ I LLL^TSLCFAAGirvTGDVCKZKI CS CNE 1 EGDLKVD CEKKGrTSLQRFTAPT S Q 

?Y:-XFlZ":GN3LTRLrPNE?AK?TKAVSVr:KENKGLHEIVPGAFLGL!;^LVK^ 

KS FRKQT F LGLDDLEYLQ^iJD ENLLRD I DFG AFQDLNKLEVLI LNDNLI STLP ANV? 

P I THLDLRGNRLKTL? YSE VLEQ I PGI A£ I LLEDNPWDCTCDLLSLKEWLENI PKNAL I 

GRWCEAPTRLQGIOLNSTTSQDLCPLKNRVDSSLPAPPAQEETFAPGPLPTPFKTNGQ 

EDHATPGSAPNGGTKIPGNWQIKIRPTAAIATGSSRNKPLANSLPCPGGCSCDKIPGSG 

LKMNCNNRKV3S LJ^LKPiaSNVOELFLRDNKIHS IRKSHFVDYra^L 1 

VENI^TFKNLLDLRWLY^©SKYIJDTLSREKFAGLQNLEYLNVEm\IQLILPGTFNAMPK 

LR : L : LNl^ruhRSL F VDV? AG VS L£ KLS li^NNY FMYLPVAGVLDQLTS 1 1 Q I DI^HGNP W 

SCSCTIVPFXQWAERLGSEVLMSDLKCETFVNFFRKDFMLlLSNDEICPQLYARISPTLT 

SHS:^^S?GIJ^ETGTHSKSYLDTSRVS I SVLVPGLLLVFVTSAFTWGMLVF I LRI^ 

KRRDAI^ S S ASE IN SLQr^CD S S YWHNG? YNADGAHRVYDCGSHSLSD 
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FIGURE 3> 

AGrCGACTGCGTCCCCTGTACCCGSCGCCAGCTGrG'TTCCTG^^.CCCCAG^iATAACTCAGG 

GCTGCAGCGGGCCTGGCAGCGCTCCGCACACATTTCCTGTCGCGGCCTA^'GGGAAACTGT 

TGGCCGCTGGGCCCGCGGGGGGATTCTTGGCAGTTGGGGGGTCCGTCGGGAGCGAGGGCG 

GAGGGGAAGGGAGGGGGAACCGGGTTGGGGAAGCCAGCTGTAGA-GGGCGGTGACCGCGCT 

CCAGACACAGCTCTGCGTCCTCGAGCGGGACAGATCCAAGTTGC-GAGCAGCTCTGCGTGC 
GGGGCCTCAGAGA 

XMET ltrans=l-s, dir=f/ res=l} 

ATGA.GGCCGGCGTTCGCCCTGTGCCTCCTCTGGCA-GGCGCTCTGGCCCGGGCCGGGCGGC 
GGCGAACACCCCACTGCCGAXCGTGCTGGCTGCTCGGCCTCGGGGGCCTGCTACAGCCTG 
CAGCACGCTACCATGAAGCGGCAGGCGGCCGAGGAGGCCTGCATCCTGCGAGGTGGGGCG 
CTCAGCACCGTGCGTGCGGGCGCCGAGCTGCGCGCTGTGCTCGCGCTCCTGCGGGCAGGC 
CCA.GGGCCCGGAGGGGGCTCCAAAGACCTGCTGTTCTGGGTCGCACTGGAGCGCAGGCGT 
TCCCACTGCA-CCCTGGAGAiVCGAGCCTTTGCGGGGTTTCTCCTGGCTGTCCTCCGACCCC 
GGCGGTCTCGAAAGCGA.CACGCTGCAGTGGGTGGAGGAGCCCCAACGCTCCTGCACCGCG 
CG'GAGATGCGCGGTACTCCAGGCCACCGGTGGGGTCGAGCCCGCAGGCTGGAJVGGAGATG 
CGATGCCACCTGCGCGCCAACGGCTACCTGTGCAAGTACCAGTTTGAGGTCTTGTGTCCT 
GCGCCGCGCCCCGGGGCCGCCTCTAACTTGAGCTATCGCGCGCCCTTCCAGCTGCACIAGC 
GCCGCTCTGGACTTCAGTCCACCTGGGACCGAGGTGAGTGCGCTCTGCCGGGGACAGCTC 
CGGATCTCAGTTACTTGCATCGCGGACGAAATCGGCGCTCGCTGGGACAAACTCTCGGGC 
GATGTGTTGTGTCCCTGCCCCGGGAGGTACCTCCGTGCTGGCAAATGCGCAGAGCTCCCT 
AACTGCCTAGAXGACTTGGGAGGCTTTGCCTGCGAATGTGCTACGGGCTTCGAGCTGGGG 
AAGGACGGCCGCTCTTGTGTGACCAGTGGGGAAGGACAGCCGACCCTTGGGGGGACCGGG 
GTGCCCACCAGGCGCCCGCCGGCCACTGCAACCAGCCCCGTGCCGCAGAGAACATGGCCA 
ATCAGGGTCGACGAGA-AGCTGGGAGAGACACCACTTGTCCCTGAACAAGACAATTCAGTA 
ACATCTATTCCTGAGATTCCTCGATGGGGATCACAGAGCACGATGTCTACCCTTCAAATG 
TCCCTTCAAGCCGAGTCAAJIGGCCACTATCACCCCATCAGGGAGCGTGATTTCCAAGTTT 
AATTCTACGACTTCCTCTGCCACTCCTCAGGCTTTCGACTCCTCCTCTGCCGTGGTCTTC 
ATATTTGTGAGCACAGCAGTAGTAGTGTTGGTGATCTTGACCATGACAGTACTGGGGCTT 
GTCAJVGCTCTGCTTTCACGAAAGCCCCTCTTCCCAGCCAAGGAAGGAGTCTATGGGCCCG 

ccgggcctggagagtgatcctgagcccgctgctttgggctccagttctgcacattgcaca 
a;^.c-?-atggggtgajvagtcggggactgtgatctgcgggacagagcagagggtgccttgctg 

GCGGAGTCCCCTCTTGGCTCTAGTGATGCATAGGGAAACAGGGGAXA-TGGGCACTCCTGT 
G/^J-CAGTTTTTCACTTTTGATGAAACGGGGAACCAAGAGGAACTTACTTGTGTAACTGAC 
AATTTCTGCAGAJiATCCCCCTTCCTCTAAATTCCCTTTACTCCACTGAGGAGCTAAATCA 
G.^_^.CTGCACACTCC?TCCCTGATGATAGAGGAAGTGGAAGTGCCTTTAGGATGGTGATAC 
TGGGGGACCGGGTAGTGCTGGGGAGAGATATTTTCTTATGTTTATTCGGAGAATTTGGAG 

aagtgattgaacttttcaagaca.ttggaaacaaatagaaca.caatataatttacattaaa 

AJ^J^.T;iJirTTCTACCAAAATGGAJUi.GGAAATGTTCTATGTTGTTCAGGCTAGGAGTATATT 
GGTTCGAJ^JITCCCAGGGAAJW^JITAJUJiAATAAAAAATTAAAGGATTGTTGAT 
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FIGURE 3(? 

M?. ?A FAL C -LWQALW ? G P GGGEHP 7ADPAGC S AS GAC YS liKrlATMKRQAAEElAC I LRGGA 
lSrv"A.GA^lFJlVIJU.LRAG?G?GGGSKDLLrA^/AX£R.^^ 

GGLISCrLQW^/ZIFQRSCTARRCAVLQATGGVZPAGVTKHMRCKLRANGYLCKYQFEVLCP 
APRPGAASNLSYPAPFQLHSAALDFSPPGTEVSALCRGQLPISVTCIADEIGARWDKLSG 
DVLCFCPGRYLRA.GKCAELPNCLDDLGGFAXECATGFELGKDGRSCVTSGEGQPTLGGTG 
V? TR?.? PATA.TSPVPQRTWF I RVDEKLGET?LVPEQDNS'/T S I PEI PRWGSQSTMSTLQM 
S L QA^ S KAT I T P 3 G S VI S K FNS T T S S AT PQAFDS S S AWFI FVS TAWVLVILTMTVLGL 
VKLCFHSSPSSQPRKESMGPPGLESDPEPAJiJiGSSSAJiCTNNG'/KVGDCDLRDRAEGALL 
A^3?LG53DA 
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F IG URE 3 7 

CGGACGCGTGGGATTCAGCAGrGGCCTGTGGCTGCCAGi^.GCAGCTCCTCAGGGGA;^JlCTA 
AGCGTCGAGTCAGACGGCAXCATAA-.TCGCCTTTAAAAGTGCCTCCGCCCTGCCGGCCGCG 
TATCCCCCGGCTACCTGGGCCGCCCCGCGGCGGTGCGCGCGTGAGAGGGAGCGCGCGGGC 
AGCCGAGCGCCGGTGTGAGCCAGCGCTGCTGCCAGTGTGAGCGGCGGTGTGAGCGCGGTG 
GGTGCGGAGGGGCGTGTGTGCCGGCGCGCGCGCCGTGGGGTGCAAACCCCGAGCGTCTAC 
GCTGCC 

xMET {tr£ns=l-s, dir=f, res=l} 

ATGAGGGGCGCGAACGCCTGGGCGCCACTCTGCCTGCTGCTGGCTGCCGCCA.CCCAGCTC 
TCGCGGCAGCAGTCCCCAGAGA.GACCTGTTTTCACATGTGGTGGCAT7CTTACTGGAGAG 
TCTGGATT?ATTGGCAGTGAAGGTTTTCCTGGAGTGTACCCTCCAAATAGCAAATGTACT 
T GG AAAAT C AC A.GT TCCCGAAG G AAAJIG T AGT C G T T C T CAAT T T C C GAT T C ATA.GAC C T C 
GAGAGTGACAACCTGTGCCGCTATGA.CTTTGTGGATGTGTACAATGGCCATGCCAATGGC 
CAGCGCATTGGCCGCTTCrGTGGCACTTTCCGGCCTGGAGCCCTTGTGTCCAGTGGCAAC 
Ai^.GATGATGGTGCAGATGATTTCTGA.TGCCAACACAGCTGGCAATGGCTTCATGGCCA.TG 
TTCTCCGCTGCTGAACCAAACGA-AAGAGGGGATCAGTATTGTGGAGGACTCCTTGACAGA 
CCTTCCGGCTCTTTTAAAACCCCCAACTGGCCAGACCGGGATTACCCTGCAGGAGTCACT 
TGTGTGTGGCACATTGTAGCCCCAAAGAATCAGCTTATAGAATTAAAGTTTGAGAAGTTT 
GATGTGGAGCGA.Gi^.TAACTACTGCCGATATGATTATGTGGCTGTGTTTAATGGCGGGGAA 
GTCAACGATGCTAGAAGAATTGGAAAGTA.TTGTGGTGATAGTCCACCTGCGCCAATTGTG 
TTTGAGA.GAAATGAACTTCTTATTCAGTTTTTATCAGACTTAAGTTTAACTGCAGATGGG 
TTTATTGGTCACTACATATTCAGGCCAAJiAAAACTGCCTACAACTACAGAACAGCCTGTC 
ACCACCACATTCCCTGTAACCACGGGTTTAAAACCCACCGTGGCCTTGTGTCAACAAAAG 
TGTAGACGGACGGGGACTCTGGAGGGCAATTATTGTTCAAGTGACTTTGTATTAGCCGGC 
ACTGTTATCACAACCATCACTCGCGA.TGGGA-GTTTGCACGCCACAGTCTCGATCATCAAC 
ATCTACAAAaAGGG.AAAT?TGGCGATTCAGCAGGCGGGCAAGAAC:ATGAGTGCCAGGCTG 
ACTGTCGTCTGCAAGCAGTGCCCTCTCCTCAGAAGAGGTCTAAATTACATTATTATGGGC 
CAAGTAGGTGAAGATGGGCGAGGCAAAA-TCATGCCAAACAGCTTTATCATGATGTTCAAG 
ACCAAGA.ATCAGAAGCTCCTGGATGCCTTAJIAAAATAAGCAATGTTAACAGTGAACTGTG 
7CCATTTAAGCTGTATTCTGCCATTGCCTTTGAAAGATCTATGTTCTCTCAGTAGAAAAA 
AAJ^J-.TACTTATAJiAATTACATATTCTGAJVA.GAGGATTCCGAAAGATGGGACTGGTTGACT 
CTTCACATGATGGAGGTATGAGGCCTCCGA.GATAGCTGAGGGAAGTTCTTTGCCTGCTGT 
CAGAG'GAGCAGCTATCTGATTGGAJVACCTGCC-GACTTAGTGCGGTGATAGGAAGCTAAAJl 
GTGTCA-AGGGrTGACAGCTTGGAAGCGTTTATTTATACATCTCTGTAAAAGGATATTTTA 
G.--AT T GAG T T G T G r GAAGAT G 7 C AAAAAAAGAT T T TAGAAG T GCAATAT T TAT AG T GT T A 
TTTGTTTCACCTTCAAGCCTTTGCCCTGAGGTGTTACAATCTTGTCTTGCGTTTTCTAAA 
TC.AAT G C TTAATAJ^AATAT T T TTAAAGGAAJ^AAAAAAAAA 
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FTGirRF. 38 

MR SANAWAr Z. C LLlAAJvT Ql S RQQS ?ERF VrTCGG ILTGESGriGSEG FF G V Y P?NS KC T 
WKI TV?EGK\a"/LNFRFI DIE S DNLGRYDrv^DVY^^GrlANGQRI GRECGTERFGALVS S GN 
KT-IM^/CM I S EAI^IIA-GNG n-L^MFS AAE PNERGDQYC GGLLDRP S G S FKTFNW? DRD YPAGVT 
C^/;VHIVA.PKNQlIELKFEKrDVERDNYCRYDr^/AVFXGGEVNDARRIGKYCGDSP?APIV 
SERNElLrOFLSDlSLTADGFIGHYIFRPKKLPTTTEQPWTTFPVTTGLKPTVALCQQK 
C RR T G T L E GNY C S S D FVLAG WITTITRDGS LHATVS 1 1 NI YKEGNLAI QQAGKNH3 ARL 
T\^/GKQCFLLRRGLNYIIMGQVGEDGRGKIMPNSFIMMFKTJCNQKLLDAXKNKQC 
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FIGURE 39 

cgga:gcgtggg:3gacgcgtgggcggcccacggcgcccgcgggctggggcggtcgcttctt 

CCTTCTCCGTGGCCTACGAGGGTCCCCAGCCTGGGTAA^.GATGGCCCCATGGCCCCCGAAGG 
GCCTAGTCCCAGCTGTGC7CTGGGGCC7CAGCCTCTTCCTCAP.CCTCCCA-GC-ACCTATCTGG 
CrCCAGCCCTCTCCACCTCCCCAGTCTTCTCCCCCGCCTCAGCCCCATCCGTGTCATACCTG 

ccggggactggttgacagctttaacaagggcctggagag;ijiccatccgggacaj^ctttggag 

GTGGA^-JICACTGCCTGGGAGGAAGAGAATTTGTCCAAATACAAAGACAGTGAGACCCGCCTG 

gtagaggtgctggagggtgtgtgcagcaagtcagacttcgagtgccaccgcctgctggagct 
gagtgaggagctggtggagaggtggtggtttcacaagcagcaggaggccccggacctcttcc 
agtggctgtgctcagattccctgaagctctgctgccccgcaggcaccttcgggccctcctgc 

CTTCCCTGTCCTGGGGGAACAGAGAGGCCCTGCGGTGGCTACC-GGCAGTGTGAAGGAGAAGG 

gacacgagggggcagcgggcactgtgactgccaagccggctacgggggtgaggcctgtggcc 
agtgtggccttggctactttgaggcagaacgcaacgccagccatctggtatgttcggcttgt 

TTTGGCCCCTGTGCCCGATGCTCAGGACCTGAGGAATCAAACTGTTTGCAATGCAAGA-AGGG 
CTGGGCCCTGCATCACCTCAAGTGTGTAGACATTGATGAGTGTGGCACAGAGGGAGCCAACT 

gtggagctgaccaattctgcgtgaacactgagggctcctatgagtgccgagactgtgccaag 
gcctgcctaggctgcatgggggcagggccaggtcgctgtaagaagtgtagccctggctatca 
gcaggtgggctccaagtgtctcgatgtggatgagtgtgagacagaggtgtgtccgggagaga 

AC^J^X-CAGTGTGAAAACACCGAGGGCGGTTATCGCTGCATCTGTGCCGAGGGCTACAAGCAG 

atgg^jiggcatctgtgtgaaggagcagatcccagagtczvgcaggcttcttctcagagatgac 
agjiji.gacgagttggtggtgctgcagcagatgttctttggcatcatcatctgtgcactggcca 
cgctggctgct;ij\gggcga.cttggtgttcaccgccatcttcattggggctgtggcggccatg 

ACTGGCTACTGGTTGTCAGAGCGCAGTGACCGTGTGCTGGAGGGCTTCATCAAGGGCAGATA 

atcgcgggcaccacctgtaggacctcctcccacccaggctgcccccagagcttgggctgccc 
tgctgctggacactcaggacagcttggttta.tttttgagagtggggtaagcacccctacctg 
ccttacagagcagcccaggtacccaggcccgggcagacaaggcccctggggtaaaaagtagc 

CC-G^^J^vGGTGGATACCATGAGCTCTTCACCTGGCGGGGACTGGCAGGCTTCACAATGTGTGA 

atttcaaaagtttttccttaatggtggctgctagagctttggcccctgcttaggattaggtg 
gtcctcacaggggtggggccatcacagctccctcctgccagctgcatgctgccagttcctgt 
tctgtgttcaccacatccccacaccccattgccacttatttattcatctcaggaaataaaga 

A,AGG T C T T GGAAAGT i AAAJIAAAJUIAAAAAAAAAAAAAAA 
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FTGTJRE 40 
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FICxURE 41 

TGAGACCCTCCTGCAGCCTTCTCAAGGGA2AGCCCCACrCTGCC7CTTGC7CCTCCAG3G 
CAGCACC 

><;*''ZT (trans = l-s, dir=:f , res = l} 

ATGCAGCCCCTGTGGCTCTGCTGGGCACTCTGGGTGTTGCCCCTGGCCAGCCCCGGGGCC 
GCCCTC-ACCGGGGAGCAGCTCCTGGGCAGCCTGCTGCGGCAGCTGCAGCTCAAAGAGGTG 
CCCACCCTGGACAGGGCCGACATGGAGGAGCTGGTCATCCCCACCCACGTGAGGGCCCAG 
TACGTGGCCCTGCTGCAGCGCAGCCACGGGGACCGCTCCCGCGGAAAGAGGTTCAGCCAG 
AGCTTCCGAGAGGTGGCCGGCAGGTTCCTGGCGTTGGAGGCCAGCACACACCTGCTGGTG 
TTCGGCATGGAGCAGCGGCTGCCGCCCAACAGCGAGCTGGTGCAGGCCGTGCTGCC-GCTC 
TTCCAGGAGCCGGTCCCCAAGGCCGCGCTGCACAGGCACGGGCGGCTGTCCCCGCGCAGC 
GCCCGGGCCCGGGTGACCGTCGAGTGGCTGCGCGTCCGCGACGACGGCTCCAACCGCACC 
TCCCTCATCGACTCCAGGCTGGTGTCCGTCCACGAGAGCGGCTGGAAGGCCTTCGACGTG 
ACCGAGGCCGTGAACTTCTGGCAGCAGCTGAGCCGGCCCCGGCAGCCGCTGCTGCTACAG 
GTGTCGGTGCAGAGGGAGCATCTGGGCCCGCTGGCGTCCGGCGCCCACAAGCTGGTCCGC 
TTTGCCTCGCAGGGGGCGCCAGCCGGGCTTGGGGAGCCCCAGCTGGAGCTGCACACCCTG 
GACCTTGGGGACTATGGAGCTCAGGGCGACTGTGACCCTGAAGCACCAATGACCGAGGGC 
ACCCGCTGCTGCCGCCAGGAGATGTACATTGACCTGCAGGGGATC-AAGTGGGCCGAGAAC 
TGGGTGCTGGAGCCCCCGGGCTTCCTGGCTTATGAGTGTGTGGGCACCTGCCGGCAGCCC 
CCGGAGGCCCTGGCCTTCAAGTGGCCGTTTCTGGGGCCTCGACAGTGCATCGCCTCGGAG 
ACTGACTCGCTGCCCATGATCGTCAGCATCAAGGAGGGAGGCAGGACCAGGCCCCAGGTG 
GTCAGCCTGCCCAACATGAGGGTGCAGAAGTGCAGCTGTGCCTCGGATGGTGCGCTCGTG 
CCAAGGAGGCTCCAGC'CATAGGCGCCTAGTGTAGCCATCGAGC-GACTTGACTTGTGTGTG 
TTTCTGAAGTGTTCGAGGGTACCAGGAGAGCTGGCGATGACTGAACTGCTGATGGACAAA 
TCCTCTGTGCTCTCTAGTGAGCCCTGAATTTGCTTCCTCTGACAAGTTACCTCACCTAAT 
TTTTGCTTCTCAGGAATGAGAATCTTTGGCCACTGGAGAGCCCTTGCTCAGTTTTCTCTA 
TTCTTATTATTCACTGCACTATATTCTAAGCACTTACATGTGGAGATACTGTAACCTGAG 

ggcagaaagccca^:tgtgtcattgtttacttgtcctgtcactggatctgggctaaagtcc 

TCCACCACCACTCTGGACCTAAGACCTGGGGTTAAGTGTGGGTTGTGCATCCCCAATCCA 
GATAJVTAAAGACTTTGTAAAACATGAATAAAACACATTTTATTCTAAAA 



42 / 128 



wo 99/14328 



?CTA;S98n9230 



FIGURE 42 

vQFLVJLCWALVr/LFI^SFGAALTGECLLGSLLRQLOLKEVrriDRADMEEL^^^ 

YVALLQHSKGDRS R 3KRFS QS FRSVAGRRIJiiEAS THILVFGMEQRLF PNSE1VQA\T:.RL 

F0£?VFKAALHRHG?J.SPRSARARVrvZWLRVTU3DGSNRT£LIDSRLVSW^ 

TEAVl^TWQQLSRFRQPLLLQVSVQREHLGPLASGAHKLVRFASCGAPAGLGEPQLELHTL 

DLGDYGAQGDCDFEAPHTEGTRCCRQEKYIDLQGMKWAENWVLEPFGFLAYECVGTCRQP 

PEAI^^iFKWPFLGFRQCIASErDSLPMIVSIKEGGRTRPQVVSLPNMRVQKCSCASDGALV 

FRRLQ? 
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GTCTGTTCCCAGGAGTCCTTC3GCGGCTGr?GTGTCAGTGGCCTC-ATCGCGATGGGGACAAA 
GGCGCAAGTCGAGAGGAAACT3TTGTGCCTCTTCATATTGGCC-ATCCTGTTGTGCTCCCTGG 
C AT T GG G C AG T G T TAG AG T G CAC TC T TCTGJ^AX CT G;^A-GT CAGAATT CC TGAGAAT AAT CCT 
GTG^-AGTTGTCCTGTGCCTACTCGGGCTTTTCTTCTCCCCGTGTGGAGTGGAAGTTTGACCA 
AG GAGACAC C ACC AGAC T C G T TTGC TATAATAACAAGATCACAGCrTCCTATGAGGACCGGG 
TGACCTTCTTGCCAACTGGTATCJirCTTCAAGTCCGTGACACGGGAAGACACTGGGACATAC 
ACTTGTATGGTCTCTGAGGAAGGCGGCAACAGCTATGGGGAGGTCAAGGTCAAGCTCATCGT 

gcttgtgcctccatccaagcctacagtta;^.catcccctcctctgccaccattgggaaccggg 
cagtgctgacatgctcagaacaagatggttccccaccttctgaatacacctggttcaaagat 

GGGATAGTGATGCCTACGAATCCCAAAAGCACCCGTGCCTTCAGGAACTCTTCCTATGTCCT 
GAATCCCAC?JICAGGAGAGCTGGTCTTTGA7CCCCTGTCAGCCTCTGATACTGGAGAATACA 
GCTGTGAGGCACGGAATGGGTATGGGACACCCATGACTTCAAATGCTGTGCGCATGGAAGCT 
GTGGAGCGGAATGTGGGGGTCATCGTGGCAGCCGTCCTTGTAACCCTGATTCTCCTGGGAAT 
CTTGGTTTTTGGCATCTGGTTTGCCTATAGCCGAGGCCACTTTGACAGAACAAAGAAAGGGA 
C T T CGAG T AAG AAGG T GAT T T ACAG CCAGCCT AGTGCCCGAJi.GT GAAGGAGAATTCAAACAG 
ACCTCGTCATTCCTGGTGTGAGCCTGGTCGGCTCACCGCCTATCATCTGCATTTGCCTTACT 
CAGGTGCTACCGGACTCTGGGCCCTGATGTCTGTAGTTTCACAGGATGCCTTATTTGTCTTC 
TACACGCCACAGGGCCCCCTACTTCTTCGGATGTGTTTTTAATAATGTCAGCTATGTGCCCC 
ATCCTCCTTCATGCCCTCCCTCCCTTTCCTACCACTGCTGAGTGGCCTGGAACTTGTTTAAA 
GTGTTTATTCCGCA.TTTCTTTGAGGGATCAGGAAGGAATCCTGGGTATGCCATTGACTTCCC 
TTCTAAGTAGACAGCAAAAATGGCGGGGGTCGCAGGAATCTGCACTCAACTGCCCACCTGGC 
TGGCAGGGATCTTTGAATAGGTATCTTGAGCTTGGTTCTGGGCTCTTTCCTTGTGTACTGAC 
GACCAGGGCCAGCTGTTCTAGAGCGGGAATTAGAGGCTAGAGCGGCTGAAATGGTTGTTTGG 
TGATGACACTGGGGTCCTTCCATCTCTGGGGCCCACTCTCTTCTGTCTTCCCATGGGAAGTG 
CCACTGGGATCCCTCTGCCCTGTCCTCCTGAATACAAGCTGACTGACATTGACTGTGTCTGT 
GGAJIAATGGGAGCTCTTGTTGTGGAGAGGATAGTAAATTTTCAGAGAACTTGAAGCCAAAAG 
GATTTJ^AJ^.CCGCTGCTCTA^AGAJiAAGAAAACTGGAGGCTGGGCGCAGTGGCTCACGCCTG 
T.^.2.TCCCAGAGGCTGAGGCAGGCGGATCACCTGAGGTCGGGAGrTCGGGATCAGCCTGACCA 
ACATGGAGAAACCCTACTGGAAATACAAAGXTAGCCAGGCATGGTGGTGCATGCCTGTAGTC 
CCAGC T GC TCAGGAGCCTGGCAACAAGAGCAAJ^A.C TCCAGCTCA 
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F IG UKE 44 
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FIGTJRE 45 

CAGCGCGTGGCCGGCGCCGCTGTGGGGACAGCATGAGCGGCGGTTGGATGGCGCAGGTTGGA 

GCGTGGCGAACAGGGGCTCTGGGCCTGGCGCTGCTGCTGCTGCTCGGCGTCGGACTAGGCCT 

GGAGGCCGCCGCGAGCCCGCTTTCCACCCCGACCTCTGCCCAGGCCGCAGGCCCCAGCTCAG 

GCTCGTGCCCACCCACCAJ^.GTTCCAGTGCCGCACCAGTGGCTTATGCGTGCCCCTCACCTC-G 

CGCTGCGACAGGGACTTGGACTGCAGCGATGGCAGCGATGAGGAGGAGTGCAGGATTGAGCC 

ATGTACCCAGAAAGGGCAATGCCCACCGCCCCCTGGCCTCCCCTGCCCCTGCACCGGCGTCA 

GTGACTGCTCiGGGGGAACTGACAAGAAACTGCGCAACTGCAGCCGCCTGGCCTGCCTAGCA 

GGCGAGCTCCGTTGCACGCTGAGCGATGACTGCATTCCACTCACGTGGCGCTGCGACGGCCA 

CCCAGACTGTCCCGACTCCAGCGACGAGCTCGGCTGTGGAACCAATGAGATCCTCCCGGAAG 

GGGATGCCACAACCATGGGGCCCCCTGTGACCCTGGAGAGTGTCACCTCTCTCAGGAATGCC 

ACAA.CCATGGGGCCCCCTGTGACCCTGGAGAGTGTCCCCTGTGTCGGGAATGCCACATCCTC 

CTC7GCCGGAGACCAGTCTGGAAGCCCAACTGCCTATGGGGTTATTGCAGCTGCTGCGGTGC 

TCAGTGCAAGCCTGGTCACCGCCACCCTCCTCCTTTTGTCCTGGCTCCGAGCCCAGGAGCGC 

CTCCGCCCACTGGGGTTACTGGTGGCCATGAAGGAGTCCCTGCTGCTGTCAGAACAGAAGAC 

CTCGCTGCCCTGAGGACAAGCACTTGCCACCACCGTCACTCAGCCCTGGGCGTAGCCGGACA 

GGAGGAGAGCAGTGATGCGGATGGGTACCCGGGCACACCAGCCCTCAGAGACCTGAGTTCTT 

CTGGCCACGTGGAACCTCGAACCCGAGCTCCTGCAGAAGTGGCCCTGGAGATTGAGGGTCCC 

TGGACACTCCCTATGGAGATCCGGGGAGCTAGGATGGGGAACCTGCCACAGCCAGAACTGAG 

GGGCTGGCCCCAGGCAGCTCCCAGGGGGTAGAACGGCCCTGTGCTTAAGACACTCCCTGCTG 
CCCCGTCTGAGGGTGGCGATTAAAGTTGCTTC 
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FIGURE 46 

><£S .DNr-3 322: 

xsubur.iz 1 cf 1, 232 aa, 1 stop 
xHW: 28991, pi: 4.62, NX (S/T) : 5 

MSGGWNlAQVGAWKTGALGi^XLLLLC-LGLGLEAAASFLSTPTSAQAAG?SSGSC??TKFC 
CRT£GLCVP:lTWkCDRDLDCSDGSDEEECRIEPCTQKGQCPPPPGL.PC?CTGVSDCSGGT 
DKKLRNCSRI^CLAGHIJ^CTLSDDCIPLTWRCDGHPDCPDSSDELGCGTKEILPEGDATT 
KG? PVTLES VTSLRKATTMG? PVTLES VP SVGNATS S SAGDQSGS PTAYGVI AAAAVLSA 
EZ.VTATLL:jL£VrLRAQERLRPLGIiLVAMKESLLLSEQKTSLP 
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FTGimE 47 

CCCACGCGTCCGGTCTCGCTCGCTCGCGCAGCGGCGGCAGCAGAGGTCGCGCACAGATGCGG 
GTTAGACTGGCGGGGGGAGGAGGCGGAGGAGGGAAGGAAGCTGCATGCATGAGACCCACAGA 
CTCTTGCAAGCTGGATGCCCTCTG7GGATGAAAGATGTATCATGGAATGAACCCGAGCAATG 
GAGATGGATTTCTAGAGCAGCAGCAGCAGCAGCAGCAACCTCAGTCCCCCCAGAGACTCTTG 
GCCGTGATCCTGTGGTTTCAGCTGGCGCTGTGCTTCGGCCCTGCACAGCTCACGGGCGGGTT 
CGA.TGACCTTCAAGTGTGTGCTGACCCC3GCATTCCCGAGAATGGCTTCAGGACCCCCAGCG 
GAGGGGTTTTCTTTGAAGGCTCTGTAGCCCGATTTCACTGCCAAGACGGATTCAAGCTGAAG 
G G CG CT ACAAAG AGACTG TGTTTG AAGCATTTTAATGGAACCCTAGGCTGG ATCCC AAGTGA 
TAATTCCATCTGTGTGCAAGAAGATTGCCGTATCCCTCAAATCGAAGATGCTGAGATTCATA 
ACAAGACATATAGACATGGAGAGAAGCTAATCATCACTTGTCATaAAGGATTCAAGATCCGG 
TACCCCGACCTACACAATATGGTTTCATTATGTCGCGATGATGGAACGTGGAATAATCTGCC 
CATCTGTCAAGGCTGCCTGAGACCTCTAGCCTC7TCTAATGGCTATGTAAACATCTCTGAGC 
TCCAGACCTCCTTCCCGGTGGGGACTGTGATCTCCTATCGCTGCTTTCCCGGATTTAAACTT 
GATGGGTCTGCGTATCTTGAGTGCTTACAAAACCTTATCTGGTCGTCCAGCCCACCCCGGTG 
CCTTGCTCTGGAJi.GCCCAAGTCTGTCCACTACCTCCAATGGTGAGTCACGGAGATTTCGTCT 
GCCACCCGCGGCCTTGTGAGCGCTACAACCACGGAACTGTGGTGGAGTTTTACTGCGATCCT 
GGCTACAGCCTCACCAGCGACTACAAGTACATCACCTGCCAGTATGGAGAGTGGTTTCCTTC 
TTATCAAGTCTACTGCATCAAATCAGAGCAAACGTGGCCCAGCACCCATGAGACCCTCCTGA 
CCACGTGGAAGA.TTGTGGCGTTCACGGCAACCAGTGTGCTGCTGGTGCTGCTGCTCGTCATC 
CTGGCCAGGATGTTCCAGACCAAGTTCAAGGCCCACTTTCCCCCCAGGGGGCCTCCCCGGAG 
TTCCAGCAGTGACCCTGACTTTGTGGTGGTAGACGGCGTGCCCGTCATGCTCCCGTCCTATG 
ACGAAGCTGTGAGTGGCGGGTTGAGTGCCTTAGGCCCCGGGTACATGGCCTCTGTGGGCCAG 
GGCTGCCCCTTACCCGTGGACGACCAGAGCCCCCCAGCATACCCCGGCTCAGGGGACACGGA 
GACAGGCCCAGGGGAGTCAGAAACCTGTGACAGCGTCTCAGGCTCTTCTGAGCTGCTCCAAA 
GTCTGTATTCACCTCCCAGGTGCCAAGAGAGCACCCACCCTGCTTCGGACAACCCTGACATA 
ATTGCCAGCACGGCAGAGGAGGTGGCATCCACCAGCCCAGGCATCCATCATGCCCACTGGGT 
GTTGTTCCTAAGAJLACTGATTGATTAAAAAATTTCCCAAAGTGTCCTGAAGTGTCTCTTCAA 
ATACATGTTGATCTGTGGAGTTGATTCCTTTCCTTCTCTTGGTTTTAGACAAATGTAAACAA 
AGCTCTGATCCTTAAAATTGCTATGCTGATAGAGTGG7GAGGGCTGGAAGCTTGATCAAGTC 
CTGTTTCTTCTTGACACAGACTGATTAAJ^TTAAAAGNAAAAAA 
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FIGTJT^E 48 

XSS.DNA331C7 

xsubunit 1 cf 1, 4 SO aa, I stop 
xKW: 53520, pi: 5.41, NX(S/T): 4 

MYHGMNFSNGDGFLEQQQQQQQPQSPQRLLAVILWFQLALCFGPAQLTGGFDDLQVCADP 
G I FENGFRTPSGGVFr EGS VARFHCQDGFKLKGATKRLCLKHFNGTLGWI PSDNS I CVQE 
DCPa FQ lEDAE IHK^TYT^GEKLI ITCHEGFKIRYPDLHNMVSLCRDDGTWNNL? I COGC 
LRFLJiSSNGYVNISZLQTSFPVGTVISYRCFPGFKIJDGSAYLECLQNLIWSSSPFRCLAL 
EASVCPLPFrWSHGDFVCHPRPCERyNHGTWEFyCDPGYSLTSDYKYITCQYGEWFPSY 
QVYCIKSEQTWPSTHETLLTTWKIVAFTATSVLLVLLLVILARMFQTKFKAHFPPRGPPR 
SSSSDFDFVWDGVPVMLPSYDEAVSGGLSALGPGYMASVGQGCPLPVDDQSPPAYPGSG 
DTDTGPGESETCDSVSGSSEIiLQSLYSPPRCQESTHPASDtJPDIIASTAEEVASTSPGIH 
HAKWVLFLRN 
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FICtTJRE 49 

CCCACGCGTCCGCTCCGCGCCCTCCCCCCCGCCTCCCGTGCGGTCCGTCGGTGGCCTAGAGJi. 

TGCTGCTGCCGCGGirGCAGTTGTCGCGCACGCCTCTGCCCGCCAGCCCGCTC'CACCGCCGT 

AGCGCCCGAGTGTCGGGGGGCGCACCCGAGTCGGGCCATG^i-GGCCGGGAACCGCGCTACAGG 

CCG7GCTGCTGGCCGTGCTGCTGGTGGGGCTGCGGGCCGCGA.CGGGTCGCCTGCTGAGTGCC 

TCGGATTTGGACCTCAGAGGAGGGCAGCCA.GTCTGCCGGGGAGGGACACAGAGGCCTTGTTA 

TAA;^.GTCATTTACTTCCATGATACTTCTCGAP.GACTGAJ^CTTTGAGGAAGCCAAAGA;i.GCCT 

GCAGGAGGGATGGAGGCCAGCTAGTCAGCATCGAGTCTGAAGATGAACAGAAACTGATAGAA 

>^J^.GTrCATTG^Ji.AACCTCTTGCCA?CTGATGGTG?.CTTCTGGATTGGGCTCAGGAGGCGTGA 

GGAGAJiJ^CAAJ^.GC^JVTAGCACAGCCTGCCAGGACCTTTATGCTTGGACTGATGGCAGCATAT 

CACAATTTAGGAACTGGTATGTGGATGAGCCGTCCTGCGGC^.GCGAGGTCTGCGrGGTCATG 

TACCATCAGCCATCGGCACCCGCTGGCATCGGAGGCCCCTACATGTTCCAGTGGAATGATGA 

CCGGTGCAACATGAAGAACAATTTCATTTGCAJ^ATATTCTGATGAGAAACCAGCAGTTCCTT 

CTAGAGAAGCTGAAGGTGAGGAJVACAGAGCTGACAACACCTGTACTTCCAGAAGAAACACAG 

GAAGAAGATGCCAAAAAAACATTTAAAGA;iAGTAGAGAAGCTGCCTTGAATCTGGCCTACAT 

CCTA.ATCCCCAGCATTCCCCTTCTCCTCCTCCTTGTGGTCACCACAGTTGTATGTTGGGTTT 

GGATCTGTAGAAAAAGAAAJ^CGGGAGCAGCCAGACCCTAGCACAAAGAAGCAACACACCA.TC 

TGGCCCTCTCCTCACCAGGGAAACAGCCCGGACCTAGAGGTCTACAATGTCATAAGAAAACA 

AJi.GCGAAGCTGACTTAGCTGAGACCCGGCCAGACCTGAAGAJVTATTTCATTCCGAGTGTGTT 

CGGGAGAAGCCACTGCCGATGACATGTCTTGTG^.CTATGACAACATGGCTGTGAACCCATCA 

GAAAGTGGGTTTGTGACTCTGGTGAGCGTGGAGAGTGGATTTGTGACCAATGACATTTATGA 

GTTCTCCCCAGACCAAATGGGGAGGAGTAJi.GGAGTCTGGATGGGTGGAAAATGAAATATATG 

GTTATTAGGACATATAJiJiJUVACTGAAJVCTGACAACAATGGAAAAGAAATGATAAGCAAAATC 

C TC T TATT T T C TATA^X-G^JIAATACACAGAAGGTCTATGAACAAGCTTAGATCAGGTCCTGT 

GGATGAGCATGTGGTCCCCACGACCTCCTGTTGGACCCCCACGTTTTGGCTGTATCCTTTAT 

CCCAGCCAGTCATCCAGCTCGACCTTATGAGAAGGTACCTTGCCCAGGTCTGGCACATAGTA 

GAGTCTC;^Ji.TAAATGTCACTTGGTTGGrTGTATCTAACTTTTAAGGGACAGAGCTTTACCTG 

gcagtgat;ij^jvgatgggctgtggagcttggaaaaccacctctgttttccttgctctatacag 
cagcacatattatcatacagacagaaaatccag;^atcttttcajuvgcccacatatggtagca 
caggttggcctgtgcatcggcaattctcatatctgtttttttcaaagaataaaatcaaataa 

AGAGCAGGAAAA;iJUiAA 
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FICxTJRE SO 
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FTGIJRF 51 

GGGGTCTCCCTCAGGGCCGGGAGGCACAGCGGTCCCTGCTTGCTGAAGGGCTGGATGTACGC 
ATCCGCAGGTTCCCGCGGACTTGGGGGCGCCCGCTGAGCCCCGGCGCCCGCE.GAAGACTTGT 
GTTTGCCTCCTGCAGCCTCAACCCGGAGGGCAGCGAGGGCCTACCACCATGATCACTGGTGT 
GTTCAGCATGCGCTTGTGGACCCCAGTGGGCGTCCTGACCTCGCTGGCGTACTGCCTGCACC 
AGCGGCGGGTGGCCCTGGCCGAGCTGCAGGAGGCCGATC-GCCAGTGTCCGGTCGACCGCAGC 
CTGCTGAAGTTGAAAATGGTGCAGGTCGTGTTTCGACACGGGGCTCGGAGTCCTCTCAAGCC 
GGTCCCGCTGGAGGAGCAGGTAGAGTGGAACCCCCAGCTATTAGAGGTCCCACCCCAAACTC 
AGTTTGATTACACAGTCACCAATCTAGCTGGTGGTCCGAAACCATATTCTCCTTACGACTCT 
CAATACCATGAGACCACCCTGAAGGGGGGCATGTTTGCTGGGCAGCTGACCAAGGTC-GGCAT 
GCAGCAAATGTTTGCCTTGGGAGAGAGACTGAGGAAGAACTATGTGGAAGACATTCCCTTTC 
TTTCACCAACCTTCAACCCACAGGAGGTCTTTATTCGTTCCACTAACATTTTTCGGAATCTG 
GAGTCCACCCGTTGTTTGCTGGCTGGGCTTTTCCAGTGTCAGAAAGAAGGACCCATCATCAT 
CCACACTGATGAAGCAGATTCAGAAGTCTTGTATCCCAACTACCAAAGCTGCTGGAGCCTGA 
GGCAGAGAACCAGAGGCCGGAGGCAGACTGCCTCTTTACAGCCAGGAATCTCAGAGGATTTG 
AAAAAGGTGAAGGACAGGATGGGCATTGACAGTAGTGATAAAGTGGACTTCTTCATCCTCCT 
GGACAACGTGGCTGCCGAGCAGGCACACAACCTCCCAAGCTGCCCCATGCTGAAGAGAnTG 
CACGGATGATCGAACAGAGAGCTGTGGACACATCCTTGTACATACTGCCCAAGGAAGACAGG 
GAAAGTCTTCAGATGGCAGTAGGCCCATTCCTCCACATCCTAGAGAGCAACCTGCTGAAAGC 
CATGGACTCTGCCACTGCCCCCGACAAGATCAGAAAGCTGTATCTCTATGCGGCTCATGATG 
TGACCTTCATACCGCTCTTAATGACCCTGGGGATTTTTGACCACAAATGGCCACCGTTTGCT 
GTTGACCTGACCATGGAACTTTACCAGCACCTGGAATCTAAGGAGTGGTTTGTGCAGCTCTA 
TTACCACGGGAAGGAGCAGGTGCCGAGAGGTTGCCCTGATGGGCTCTGCCCGCTGGACATGT 
TCTTGAATGCCATGTCAGTTTATACCTTAAGCCCAGAAAAATACCATGCACTCTGCTCTCAA 
ACTCAGGTGATGGAAGTTGGAAATGAAGAGTAACTGATTTATAAAAGCAGGATGTGTTGATT 
TTAAAATAAAGTGCCTTTATACAATG 



52 / 128 



wo 99/14328 



PCT/US98/19330 



FTGTTRE 52 

>< s S . 4 4 3 4 

xsucunxc 1 cf 1, 426 aa, 1 s.op 
xi-TW: 48836, pi: 6. 55, NX(S/T): 0 

K I TGVF SMRLWT P VCA^LT S LAY CLHQRRVAIJiJE LQE AD G QC? VT:RS LLKLK>:VQ W 
ARSPLKFLFLEECvT-WI^IPQLLEVPPQTQFDYTVTKLAGGFKPySFYDSOYHETTLKGGMF 
AG0LTKVGMQQM?ALGERLRKNYVSDIFFLS?7?NPQEVrIR£TNIFRKLE£TRCLLAGL 
FCCQKEGFIIIKTDEATSEVLYPNYQSCWSLRQRTRGRRQTASIjQPGISEDLKKVKDRMG 
I D S S D KVDF F I LLDNVAAEQ AiiKLP S CPKLKRi^" ARMI EQRAVDT SLY I LPKEDRESLQMA 
VG? F LH I LE £ NLL:<A:-n:SATAFDK I RKLYLYAAKDVTF I PLLKTLG IFDHKWPPFAVDLT 
MELYQKlHS?:EV:FVQLYYHGKEQV?RGC?DGLC?LDMFLNAM£'m'LS?EKYHALCSQTQ 
WEVGN'EE 
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FIGURE 13 

CrCCTCTTAACATACTTGCAGCTi:^J^J.JlCTA.^^TATTGC7GCTTC^^G^^^^ 

TAJ^JvTTTCAGGTCATCACCTTCACCTGCCTTGG7C;iTGGCTCTGCT;^TTCTCCTTGAT--TT 

^^^i;::^X^H^^-^-^-^^-^^-^T^^TAGCGTCTCCAT^^ 

ACGGCTGGGACATTAAGGACGTGGCTGTGTTGTGCCGGGAGCTGGGCTGTGGAGCTG-CAGC 
GG;^_t.ccCCTAGTGGTATTTTGTATGAGCCACCAGCAGAA;^J^GAGCAAAAGGTCCTci^CCA 
ATCAGTCAGTTGCACAGGAACAGAAGATACATTGGCTCAGTGTGAGCAAGAAGAAGTTTATG 
ATTGTTCACATGATGAAGATGCTGGGGCATCGTGTGAGAACCCAGAGAGCTCTTTCTCCCC^ 
GTCCCAGAGGGTGTCAGGCTGGCTGAGGGCCCTGGGCATTGCAAGGGACGCGTGGAAGTGAA 
GCACCAGA.^CCAGTGGTATACCGTGTGCCAGACAGGCTGGAGCCTCCGGGCCGC^GGTG^ 
TGTGCCGGCAGCTGGGATGTGGGAGGGCTGTACTGACTCAAAAACGCTGCAACAAGCATGCC 
TATGGCCG.AAAA.CCCATCTGGCTGAGCCAGATGTCATGCTCAGGACGAGAAGCAACCCTT^ 
GGATTGCCCTTCTGGGCCTTGGGGGAAGAACACCTGCAACCATGATGAAGACACGTGGGTCG 
AATGTG^J.GATCCCTTTGACTTGAGACTAGTAGGAGGAGACAACCTCTGCTCTGGGCGACTG 

CCA^-u-TGGTATGCAAGCAACTGGGCTGTGGGAAGTCCCTCTCTCCCTCCTTCAGAGACCGGA 

^?;^^;[S^^^^^^I^S^-tt^ccgcatctggctggataatgitcgttgctS^^ 

X GTGG^TGTCATC i GCTCAGTGTAGGTGGGCATCATCTAATCTGTTGAGTGCCTGAATAGAA 




ccacctttcctatgtctccacattgcacacagcagattcccagcctccSaattgtgtgt^ 
caactacttaaatacattctcacacacacacacacacacacacacacacaca^cacaSt^^ 

CACCATTrQTCCTGTTTCTCTGAAGAACTCTGACAAAATACAGATTTTGGTACTGASGAa^ 

ttctagaggaacggaattttaaggataaattttctgaattggttatggggtttctgaK^to 
°^^^-^,\^^^^,^^^^ttagatataaaattctggtaa™^ 



54 / 128 



wo 99/14328 



PC7r/US98/I9330 



FIGURE 54 

>cSE .DNA3 31C: 

xsubunic I of 1, 34 7 aa, 1 step 
xK^l: 3E13C, pi: 5.4C, NX{S/T): 0 

MALL r S L I " CTR PC- FI^.S PS G VRL VGGLHHCE GR VE VEQ KGQ WGTVODGTO 
LCRZ:.GCG.3iJi,SGT?S G : LYEP ? AHK3QKVL I OS VSCTGTEDTLAQCEQEEVYDCSKDEDA 
GA£CEN?ESS?SPVFEGVRI^J3GPGHCKGRVEVKHQNQWYTVCQ7GW5LRAAKWCRQLG 
CGRA'^T^TOK^CNKHAYGRK? I WLSQMS CSGREATLQDCPSGPWGKNTCNKDEDTWVECED 
PrDLRLVGGDNLCSGRLEVLKKGVWGSVCDDNT'JGEKEDQVVCKQLGCGKSLSPSFRDRKC 
YGPGVGRIWLDrr^/RCSGEEQSLEQCQKRFWGFHDCTHQEDVAVICSV 



55 / 128 



wo 99/14328 



PCT/US98/19330 



F I GURE 5 5 

AC7GCACTCGGTTCTATCGATTGAATTCCCCGC-GGATCCTCTAGAGATCCCTCGACCTCGAC 
CCACGCGTCCGCGGACGCGTGGGGGGACGCGTGGGCCGGCTACCAG3AAGAGTGTGGCGAAG 
GTGAAGGCCATGGACTTCATCACCTCCACAGCCATCCTGCCCCTGCTGTTCGGGTGCCTGGG 
CGTCTTCGGCCTCTTCCGGCTGCTGaAGTGGGTGCGCGGGAAGGCCTACCTGCGGAATGCTG 
TGG T GGTG ATCAC AGG CG C C ACCTCAGGG CTGGGCAAAGAATGTGCAAAAGTCTT CTATGCT 
GCGGG7GCTAAACTGGTGCTCTGTGGCCGGAATGGTGGGGCCCTAGAAGAGCTCATCAGAGA 
ACTTACCGCTTCTCATGCCACCAAGGTGCAGACACACAAGCCTTACTTGGTGACCTTCGACC 
TCACAGACTGTGGGGCCATAGTTGCAGCAGCAGCTGAGATCCTGCAGTGCTTTGGCTATGTC 
GACATACTTGTCAACAATGCTGGGATCAGCTACCGTGGTACCATCATGGACACCACAGTGGA 
TGTGGACAAGAGGGTCATGGAGACAAACTACTTTGGCCCAGTTGCTCTAACGAAAGCACTCC 
TGCCCTCCATGATCAAGAGGAGGCAAGGCCACATTGTCGCCATCAGCAGCATCCAGGGCAAG 
ATGAGCATTGCTTTTCGATCAGCATATGCAGCCTCCAAGCACGCAACCCAGGCTTTCTTTGA 
CTGTCTGCGTGCCGAGATGGAACAGTATGAAATTGAGGTGACCGTCATCAGCCCCGGCTACA 
TCCACACCAACCTCTCTGTAAATGCCATCACCGCGGATGGATCTAGGTATGGAGTTATGGAC 
ACCACCACAGCCCAGGGCCGAAGCCCTGTGGAGGTGGCCCAGGATGTTCTTGCTGCTGTGGG 
GAAGAAGAAGAAAGATGTGATCCTGGCTGACTTACTGCCTTCCTTGGCTGTTTATCTTCGAA 
CTCTGGCTCCTGGGCTCTTCTTCAGCCTCATGGCCTCCAGGC-CCAGAAAAGAGCGGAAATCC 
AAGAACTCCTAGTACTCTGACCAGCCAGGGCCAGGGCAGAGAAGCAGCACTCTTAGGCTTGC 
TTACTCTACAAGGGACAGTTGCATTTGTTGAGACTTTAATGGAGATTTGTCTCACAAGTGGG 
AAAGACTGAAGAAACACATCTCGTGCAGATCTGCTGGCAGAGGACAATCAAAAACGACAACA 
AGCTTCTTCCCAGGGTGAGGGGAJiJi.CACTTAAGGAATAAATATGGAGCTGGGGTTTAACACT 
AAAAAC7AGAAATAAACATCTCAAACAGTAAAAAAAAAAAAAAAGGGCGGCCGCGACTCTAG 
AG7CGACCTGCAGAAGCTTGGCCGCCATGGGCCAACTTGTTTATTGCAGCTTATAATGGTTAC 
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FIGURE 56 

><3S .DNA35600 

xsubunit 1 cr 1, 310 aa, l stop 
xMW: 33524, pi: 9.55, InIX(S/T): 1 

MDFITSTArLFLLFGCL:?VTGLFPXLQWVRGKAYLHKA\Arw^ITGATS3LGKECAKVFYAA 

GAKlVLCGPJ^IGGALEELIRELTASHATKA/Qr-rKPYLVTFDLTDSGAIVAAAAKILQCFGY 

VDIL^/NNAGISYHGTIN!DTTVDVDKR^/>lETNyFG?VALTKALL?SMIKRRQGHIVAISSI 

QGK>ISIPFRSAYAJISKHATQAFFDCLRAXMEQYEIE\/TVISPGYIHTNL£VNAITADGSR 

YGVMDTTTAQGRSPVEVAQDVIJ^VGKKKKDVILADLLPSI^VYLRTI^J^GL 

ARKERKSKNS 
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nGTJR E 57 

CCCACGC3TCCGCrGGTGTTAGArCGAGCAACCCTCTAAAA3CAGTTTAGAGTGGTAAAAA.J^ 
AAAAAAAAACACACCAJiACG CT CG GAG CCAC AAAAGGG 
xWET {trans=l-s, dir=f, res = l}> 

ATGAAATTTCTTCTGGACATCCTCCTGCTTCTCCCGTTACTGATCGTCTGCTCGCTAGAGTC 

CTTCGTGAAGCTTTTTATTCCTAAGAGGAGAAAATCAGTCACCGGCGAAATCGTGCTGATTA 

CAGGAGCTGGGCATGGAATTGGGAGACTGACTGCCTATGAATTTGCTAAACTTAAAAGCAAG 

CTGGTTCTCTGGGATATAAATAAGCATGGACTGGAGC-AAACAGCTGCCAAATGCAAGGGACT 

GGGTGCCAAGGTTCATACCT7TGTGGTAGACTGCAGCAACCGAGAAGATATTTACAGCTCTG 

CAAAGAAGG7GAAGGCAGAAATTGGAGATGTTAGTATTTTAGTAAATAATGCTGGTGTAGTC 

TATACATCAGATTTGTTTGCTACACAAGATCCTCAGATTGAAAAGACTTTTGAAGTTAATGT 

ACTTGCACATTTCTGGACTACAAAGGCATTTCTTCCTGCAATGACGAAGAATAACCATGGCC 

ATATTGTCACTGTGGCTTCGGCAGCTGGACATGTCTCGGTCCCCTTCTTACTGGCTTACTGT 

TCAJVGCAAGTTTGCTGCTGTTGGATTTCATAAJ^CTTTGACAGATGAACTGGCTGCCTTACA 

AA.rAACTGGAGTCAAJU^CAACATGTCTGTGTCCTAATTTCGTAAACACTGGCTTCATCAAAA 

ATCCAAGTACAAGTTTGGGACCCACTCTGGAACCTGAGGAAGTGGTAAACAGGCTGATGCAT 

GGGATTCTGACTGAGCAGAAGATGATTTTTATTCCATCTTCTATAGCTTTTTTAACAACATT 

GGAAAGGATCCTTCCT6AGCGTTTCCTGGCAGTTTTAAAACGAAAAATCAGTGTTAAGTTTG 

ATGCAGTTATTGGATATAAAATGAAAGCGCAATAAGCACCTAGTTTTCTGAAAACTGATTTA 

CCAGGTTTAGGTTGA7GTCATCTAATAGTGCCAGAATTTTAATGTTTGAACTTCTGTTT7TT 

CTAATTATCCCCATTTCTTCAATATCATTTTTGAGGCTTTGGCAGTCTTCATTTACTACCAC 

TTGTTCT7TAGCCAAAAGCTGATTACATATGATATAAACAGAGAAATACCTTTAGAGGTGAC 

TTTAAGGAAAATGAAGAAAAAGAACCAAAATGACTTTATTAAAATAATTTCCAAGATTATTT 

GTGGCTCACCTGAAGGCTTTGCAAAATTTGTACCATAACCGTTTATTTAACATATATTTTTA 

TTTTTGATTGCAC?TAAATTTTGTATAATTTGTGTTTCTTTTTCTGTTCTACATAAAATCAG 

AA.^.CTTCAAGCTCTGTAAATAAAATGAAGGACTATATCTAGTGGTATTTCACAATGAATATC 

ATGA_£.CTCTCAATGGGTAGGTTTCATCCTACCCATTGCCACTCTGTTTCCTGAGAGATACCT 

CACATTCCAATGCCAAACATTTCTGCACAGGGAAGCTAGAGGTGGATACACGTGTTGCAAGT 

ATAAAAGCATCAGTGGGATTTAAGGAGAATTGAGAGAATGTACCCACAAATGGCAGCAATAA 

TAAATGGATCACACTTAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
^JiAAA^JiJlAAAAJiJ^AAAAJVAAAA.z^^^ 
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HGU RE 58 

> < subunit 1 of 1 , 300 aa, 1 stop 

> <MW: 32964, pi: 9.52 

Xsignal peptide> 
MKFLLDILLLLPLLIVCSL 
><stan mature protein^ 

ESFVKLFlPKRRKSVTGErVT.nGAGHGIGRLTAYEFAKUCSKL\^ 

CKGLG.^KVmFV\T)CSNREDIYSSAKKVKAHGDVSILVNT^AGVVYTSD^^ 

TFE\Wl.AHFWTTKAFLPA^raCNNHGHI\TVASAAGHVSVPFLLA 

xpuTaiive oxidoreductase active site, by similarity to y00P_MYCTU and BUDC„KLETE> 

YCSSKFAAVGFHKTLTDELAALQITGVKTTCLCPNFVOTGHKNPSTSLGPTL^ 

RLiMHGE.TEQK\IIFIPSSIAH.TTLERILPEIU^AVLKRKISV^ 
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HGU RE 59 

CCCACGCGTCCGCGGACGCGTGGGTCGACTAGTTCTAGATCGCGAGCGGCCGCCCGCGGCTC 
AGGGAGGAGCACCGACTGCGCCGCACCCTGAGAGATGGTTGGTGCCATGTGGAAGGTGATTG 
TTTCGCTGGTCCTGTTGA.TGCCTGGCCGCTGTGATGGGCTGTTTCG-CTCCCTATACAGAAGT 
GTTTCCATGCCACCTAAGGGAGACTCAGGACAGCCATTATTTCTCACCCCTTACATTGAAGC 
TGGG AA.GAT C CAAAAAGG AAGAGAATTGAGTTTGGTCGGCC CTTTCCCAGGACTGAAC ATG A 
AGAGTTATGCCGGCTTCCTCACCGTGAJITAAGACTTACAACAGCAACCTCTTCTTCTGGTTC 
TTCCCAGCTCAGATACAGCCAGAAGATGCCCCAGTAGTTCTCTGGCTACAGGGTGGGCCGGG 
AGGTTCATCCATGTTTGGACTCTTTGTGGAACATGGGCCTTATGTTGTCACAAGTAACATGA 
CCTTGCGTGACAGAGACTTCCCCTGGACCACA-ACGCTCTCCATGCTTTACATTGACAATCCA 
GTGGGCACAGGCTTCAGTTTTACTGATGATACCCACGGATATGCAGTCAATGAGGACGATGT 
AGCACGGGATTTATACAGTGCACTAATTCAGTTTTTCCAGATATTTCCTGAATATAAAAATA 
ATGACTTTTATGTCACTGGGGAGTCTTATGCAGGGAAATATGTGCCAGCCATTGCACACCTC 
ATC C ATT CC CT CAACC CTGTGAGAGAGGTGAAGATCAACCTGAACGGAATTGCTATTGGAGA 
TGGATATTCTGATCCCGAATCAATTATAGGGGGCTATGCAGAATTCCTGTACCAAATTGGCT 
TGTTGGATGAGAAGCAJUIAAAAGTACTTCCAGAAGCAGTGCCATGAATGCATAGAACACATC 
AGGAAGCAGAACTGGTTTGAGGCCTTTGAAATACTGGATAAACTACTAGATGGCGACTTAAC 
AAGTGATCCTTCTTACTTCCAGAATGTTACAGGATGTAGTAATTACTATAACTTTTTGCGGT 
GCACGGAJICCTGAGGATCAGCTTTACTATGTGAAATTTTTGTCACTCCCAGAGGTGAGACAA 
GCCATCCACGTGGGGAATCAGACTTTTAATGATGGAACTATAGTTGAAAAGTACTTGCGAGA 
AGATACAGTACAGTCAGTTAAGCCATGGTTAACTGAAATCATGAATAATTATAAGGTTCTGA 
TCTACAATGGCCAACTGGACATCATCGTGGCAGCTGCCCTGACAGAGCGCTCCTTGATGGGC 
ATGGACTGGAAAGGATCCCAGGAATACAAGAAGGCAGAAAAAAAAGTTTGGAAGATCTTTAA 
ATCTGACAGTGAAGTGGCTGGTTACATCCGGCAAGCGGGTGACTTCCATCAGGTAATTATTC 
GAGGTGGAGGACATATTTTACCCTATGACCAGCCTCTGAGAGCTTTTGACATGATTAATCGA 
TTCATTTATGGAAAAGGATGGGATCCTTATGTTGGATAAACTACCTTCCCAAAAGAGAACAT 
CAGAGGTTTTCATTGCTGAAAAGAAAATCGTAAAAACAGAAAATGTCATAGGAATAAAAAAA 
TTATCTTTTCATATCTGCAAGATTTTTTTCATCAATAAAAATTATCCTTGAAACAAGTGAGC 
TTTTGTTTTTGGGGGGAGATGTTTACTACAAAATTAACATGAG7ACATGAGTAAGAATTACA 
TTATTTAACTTAAAGGATGAAAGGTATGGATGATGTGACACTGAGACAAGATGTATAAATGA 
AATTTTAGGGTCTTGAATAGGAAGTTTTAATTTCTTCTAAGAGTAAGTGAAAAGTGCAGTTG 
TAACAAACAAAGCTGTAACATCTTTTTCTGCCAATAACAGAAGTTTGGCATGCCGTGAAGGT 
GTTTGGAAATATTATTGGATAAGAATAGCTCAATTATCCCAAATAAATGGATGAAGCTATAA 
TAGTTTTGGGGAAAAGATTCTCAAATGTATAAAGTCTTAGAACAAAAGAATTCTTTGAAATA 
AAJ^,TATTATA.TATAAAAGTAAAAAAAAAA 
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FIGURE 6fl 

XSS.DNA3 3 206 

xsubunit 1 of 1 , 476 aa, i stop 
><?-W: 54164, pi- 5.52, NX(S/T); 4 

rWGAMV;KVIVSLVLLMPGFCDGLFRSLYRSV£iyr?PKGDSC-QPLFLT?YIEAG:<IQKaREL 
SLVGPFPGLNMKSYAGFLTVNKTYNSNLFFWFFFAQIQPEDAPWLWLQGGPGGSSMFGL 
FVEHG?yVVTSNKTLRDRDFPWTTTLSMLYIDr;PVGTGFSFTDDTHGYAVNEDDVARDLY 
SALIQFFQIFPEYXNNDFYVTGESYAGKYVPAIAKLIHSLNPVREVKINLNGIAIGDGYS 
DPESIIGGYAEFLYOIGLLDEKQKKYFQKQCKECIEHIRKQNVfFEAFEILDKLLDGDLTS 
D? S YFQN^G CSNYY^^FLRCTE PEDQLYYVKFLSLPEVRQAIHVGNQTFNDGT I VEKYLR 
TDTVQS VKP WLTE I MNNYKVL I YNGQLD 1 1 VAAALTERSLMGNIDWKGS QEYKKAE KKVWK 
I F K£ D S EVAG Y I RQ AGDFHQV 1 1 RGGGH ILP YDQPLRAFDKI NR? I YGKGVTDP YVG 
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FIGURE fil 

CGAGGGCrTTTCCGGCTCCGGAATGGCACATGTGGGAATCCCAGTCTTGTTGGCTACAACAT 
TTTTCCCT7TCCTAACAAGTTCTAACAGCTGTTCTAACAGCTAGTGATCAGGC-GTTCTTGTT 
GCTGGAGAAGAAAGGGCTGAGGGCAGAGCAGGGCACTCTCACTCAGGGTGACCAGCTCCTTG 
CCTCTCTGTGGATAACAGAGCATGAGAAAGTGAAGAGATGCAGCGGAGTGAGGTGATGGAAG 
TCTAAAATAGGAAGGAATTTTGTGTGCAATATCAGACTCTGGGAGCAGTTGACCTGGAQAGC 
CTGGGGGAGGGCCTGCCTAACAAGCTTTCAAAAfACAGGAGCGACTTCCACTGGGCTGGGAT 
AAGACGTGCCGGTAGGATAGGGAAGACTGGGTTTAGTCCTAATATCAAATTGACTGGCTGGG 
TGAACTTCAACAGCCTTTTAACCTCTCTGGGAGATGAAAACGATGGCTTAAGGGGCCAGAAA 
7AGAGATGCTTTGTAAAATAAAATTTTAAAAAAAGCAAGTATTTTATAGCATAAAGGCTAGA 
GACCAAAATAGATAACAGGATTCCCTGAACATTCCTAAGAGGGAGAAAGTATGTTAAAAATA 
GAAAAACCAAAATGCAGAAGGAGGAGACTCACAGAGCTAAACCAGGATGGGGACCCTGGGTC 
AGGCCAGCCTCTTTGCTCCTCCCGGAAATTATTTTTGGTCTGACCACTCTGCCTTGTGTTTT 




ACAGCCGCTCTGTGGTCCTGCTCTCAGTGGTCTGGGTGCTGCTGGCCCCCCCAGCAGCCGGC 
ATGCCTCAGTTCAGCACCTTCCACTCTGAGAATCGTGACTGGACCTTCAACCACTTGACCGT 




CCCCTCATCGTGCAGCCCTGCAGCGAAGTGCTCACCCTCACCAACAATGTCAACAAGCTGCT 
CATCATTGACTACTCTGAGAACCGCCTGCTGGCCTGTGGGAGCCTCTACCAGGGGGTCTGCA 
AGCTGCTGCGGCTGGATGACCTCTTCATCCTGGTGGAGCCATCCCACAAGAAGGAGCACTAC 
CTGTCCAGTGTCAACAAGACGGGCACCATGTACGGGGTGATTGTGCGCTCTGAGGGTGAGGA 




- ^ ^ ^ V- ^ xv3«.vj i i u>4tjLUAi\iL rULiAL: TATGAGCTACACAGCGAT 

TTTGTCTCCTCTCTCATCAAGATCCCTTCAGACACCCTGGCCCTGGTCTCCCACTTTGACAT 

CTTCTACATCTACGGCTTTGCTAGTGGGGGCTTTGTCTACTTTCTCACTGTCCAGCCCGAGA 

CCCCTGAGGGTGTGGCCATCAACTCCGCTGGAGACCTCrrCTACACCTCACGCATCGTGCGG 

CTCTGCAAGGATGACCCCAAGTTCCACTCATACGTGTCCCTGCCCTTCGGCTGCACCCGGGC 

CGGGGTGGAATACCGCCTCCTGCAGGCTGCTTACCTGGCCAAGCCTGGGGACTCACTGGCCC 

AGGCCTTCAATATCACCAGCCAGGACGATGTACTCTTTGCGATCTTCTCCAAAGGGCAGAAG 

CAGTATCACCACCCGCCCGATGACTCTGCCCTGTGTGCCTTCCCTATCCGGGCCATCAACTT 

GCAGATCAAGGAGCGCCTGCAGTCCTGCTACCAGGGCGAGGGCAACCTGGAGCTCAACTGGC 

TGCTGGGGAAGGACGTCCAGTGCACGAAGGCGCCTGTCCCCATCGATGATAACTTCTGTGGA 

CTGGACATCAACCAGCCCCTGGGAGGCTCAACTCCAGTGGAGGGCCTGACCCTGTACACCAC 

^^*^^^^2^^-^*^^^'^'^^^^^'^^'^^'^*^^^C:TCCTACGTTTACAACGGCTACyi^ 

TGGGGACTAAGAGTGGCAAGCTGAAAAAGGTAAGAGTCTATGAGTTCAGATGCTCCAATGCC 

ATTCACCTCCTCAGCAAAGAGTCCCTCTTGGAAGGTAGCTATTGGTGGAGATTTAACTATAG 

GCAACTTTATTTTCTTGGGGAACAAAGGTGAAATGGGGAGGTAAGAAGGGGTTAATTTTGTG 

TTTCAATATTTCCCAAACTTTAAGAAAAAACTTTAAGAAGGTACATCTGCAAAAGCAAA 
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FIGURE 62 

><S5 . DNAj555B 

xsub-^nit 1 cf 1, 552 aa, 1 stop 
xMW: 61674, pi: £.95, NX(S/T); 4 

MGTLGQASLFAPPGInTFWSDHSALCFA2£CEGQPGKVEQMSTHRSRLLTAAPLSMEQRQP 

W PRALE VDS R S VVLLS VVVrsriJ:AP P AAGMPQFS T FHSENRD WTFNHLTV^ 

INRVYKLTGNLT IQVAHKTGPEEDNKSRYFFLIVQPCSEVLTLTNNVNKLLI IDYSENRI. 

LACG S LYQGVCKLLRLDDLF I LVE PSH}CKEHYLSS VNKTGTMYGVI VRSEGEDGKLF I GT 

A'/DGKQDYFPTLSSRKLPRDPESSAMLDYELHSDFVSSLIKIPSDTLALVSHFDIFYIYG 

FASGGF/YFLTVQPETPEGVAIHSAGDLFYTSRIVRLCKDDFKFHSYVSLPFGCTRAGVE 

YRLLQAAYLAKPGDSLAQAFNITSQDDVLFAIFSKGQKQYHHPPDDSALCAFPIRAINLQ 

IKSRLQSCYQGEGNLELNWLLGKDVQCTKAPVPIDDNFCGLDINQPLGGSTPVEGLTLYT 

TSRDRMTSVASYVYI^GYSWFVGTKSGKLKKVRVYEFRCSNAIHLLSKESLLEGSYWWRF 

HYRQLYFLGSQR 
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HGURE 63 

AGGCTCCCGCGCGCGGCTGAGTGCGGACTGGAGTGGGAACCCGGGTCCCCGCGCTTAGAGAA 
CACGCGATGACCACGTGGAGCCTCCGGCGGAGGCCGGCCCGCACGCTGGGACTCCTGCTGCT 
GGTCGTCTTGGGCTTCCTGGTGCrcCGCAGGCTGGACTGGAGCACCCTGGTCCCTCTGCGGC 
TCCGCCATCGACAGCTGGGGCTGCAGGCCAAGGGCTGaAACTTCATGCTGaAGGATTCCACC 
TTCTGGATCTTCGGGGGCTCCATCCACTATTTCCGTGTGCCCAGGGAGTACTGGAGGGACCG 
CCTGCTGAAGATGAAGGCCTGTGGCTTGAACACCCTCACCACCTATGTTCCGTGGAACCTGC 
ATGAGCCAGAAAGAGGCAAATTTGACTTCTCTGGGAACCTGGACCTGGAGGCCTTCGTCCTG 
ATGGCCGCAGAGATCGGGCTGTGGGTGATTCTGCGTCCAGGCCCCTACATCTGCAGTGAGAT 
GGACCTCGGGGGCTTGCCCAGCTGGCTACTCCAAGACCCTGGCATGAGGCTGAGGACAACTT 
ACAAGGGCTTCACCGAAGCAGTGGACCTTTATTTTGACCACCTGATGTCCAGGGTGGTGCCA 
CTCCAGTACAAGCGTGGGGGACCTATCATTGCCGTGCAGGTGGAGAATGAATATGGTTCCTA 
TAATAAAGACCCCGCATACATGCCCTACGTCAAGAAGGCACTGGAGGACCGTGGCATTGTGG 
AACTGCTCCTGACTTCAGACAACAAGGATGGGCTGAGCAAGGGGATTGTCCAGGGAGTCTTG 
GCCACCATCAACTTGCAGTCAACACACGAGCTGCAGCTACTGACCACCTTTCTCTTCAACGT 
C CAGG GG ACT CAGCC CAAGATGGTGATGGAGTACTGGACGGGGTGGTTTGACTCGTGGGGAG 
GCCCTCACAATATCTTGGATTCTTCTGAGGTTTTGAAAACCGTGTCTGCCATTGTGGACGCC 
GGCTCCTCCATCAACCTCTACATGTTCCACGGAGGCACCAACTTTGGCTTCATGAATGGAGC 
CATGCACTTCCATGACTACAAGTCAQATGTCACCAGCTATGACTATGATGCTGTGCTGACAG 
AAGCCGGCGATTACACGGCCAAGTACATGAAGCTTCGAGACTTCTTCGGCTCCATCTCAGGC 

a':^ccctctccctccccc:acctgaccttcttcccaagatgccgtatgagcccttaacgccagt 

CTTGTACCTGTCTCTGTGGGACGCCCTCAAGTACCTGGGGGAGCCAATCAAG'ECTGAAAAGC 

ccatcaacatggagaacctgccagtcaatgggggaaatggacagtccttcgggtacattctc 
tatgagaccagcatcacctcgtctggcatcctcagtggccacgtgcatgatcgggggcaggt 
gtttgtgaacacagtatccataggattcttggactacaagacaacgaagattgctgtccccc 
tgatccagggttacaccgtgctgaggatcttggtggagaatcgtgggcgagtcaactatggg 
gagaatattgatgaccagcgcaaaggcttaattggaaatctctatctgaatgattcacccct 
gaaaaacttcagaatctatagcctggatatgaagaagagcttctttcagaggttcggcctgg 
A caaat ggng TT c c cr c c c agaaacacccacattacctgctttcttcttgggtagcttgtcc 

ATCAGCTCCACGCCTTGTGACACCTTTCTGAAGCTGGAGGGCTGGGAGAAGGGGGTTGTATT 

CAT CAATGG C C AGAAC CTTGGACGTTACTGGAACATTGGACCCCAGAAGACGCTTTACCTCC 

CAGGTCCCTGGTTGAGCAGCGGAATCAACCAGGTCATCGTTTTTGAGGAGACGATGGCGGGC 

CCTGCATTACAGTTCACGGAAACCCCCCACCTGGGCAGGAACCAGTACATTAAGTGAGCGGT 

GGCACCCCCTCCTGCTGGTGCCAGTGGGAGACTGCCGCCTCCTCTTGACCTGAAGCCTGGTG 

GCTGCTGCCCCACCCCTCACTGCAAAAGCATCTCCTTAAGTAGCAACCTCAGGGACTGGGGG 

CTACAGTCTGCCCCTGTCTCAGCTCAAAACCCTAAGCCTGCAGGGAAAGGTGGGATGGCTCT 

GG-CCTGGCTTTGTTGATGATGGCTTTCCTACAGCCCTGCTCTTGTGCCGAGGCTGTCGGGC 

TGTCTCTAGGGTGGGAGCAGCTAATCAGATCGCCCAGCCTTTGGCCCTCAGAAAAAGTGCTG 

A.=vACGTGCCCTTGCACCGGACGTCACAGCCCTGCGAGCATCTGCTGGACTCAGGCGTGCTCT 

TTGCTGGTTCCTGGGAGGCTTGGCCACATCCCTCATGGCCCCATTTTATCCCCGAAATCCTG 

GGTGTGTCACCAGTGTAGAGGGTGGGGAAGGGGTGTCTCACCTGAGCTGACrTTGTTCTTCC 

TTCACAACCTTCTGAGCCTTCTTTGGGATTCTGGAAGGAACTCGGCGTGAGAAACATGTGAC 

TTCCCCTTTCCCTTCCCACTCGCTGCTTCCCACAGGGTGACAGGCTGGGCTGGAGAAACAGA 

AATCCTCACCCTGCGTCTTCCCAAGTTAGCAGGTGTCTCTGGTGTTCAGTGAGGAGGACATG 

TGAGTCCTGGCAGAAGCCATGGCCCATGTCTGCACATCCAGGGAGGAGGACAGAAGGCCCAG 

CTCACATGTGAGTCCTGGCAGAAGCCATGGCCCATGTCTGCACATCCAGGGAGGAGGACAGA 

AGGCCCAGCTCACATGTGAGTCCTGGCAGAAGCCATGGCCCATGTCTGCACATCCAGGGAGG 

AGGACAGAAGGCCCAGCTCACATGTGAGTCCTGGCAGAAGCCATGGCCCATGTCTGCACATC 

CAGGGAGGAGGACAGAAGGCCCAGCTCAGTGGCCCCCGCTCCCCACCCCCCACGCCCGAACA 

GCAGGGGCAGAGCAGCCCTCCTTCGAAGTGTGTCCAAGTCCGCATTTGAGCCTTGTTCTGGG 

GCCCAGCCCAACACCTGGCTTGGGCTCACTGTCCTGAGTTGCAGTAAAGCTATAACCTTGAA 
TCACAA 
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FTGTJRE 64 

MTTWSLRRRFP_^.TL31LLLWLGFLVLRRLDWSTLV?LRL}U'MLGLQAKGWNF>1LEDS 
TFWIFGGSIHYFRVFREYWRDRLLKMKJ^CGI^JTLTTY-v^PWNIJi^ 
AFVLNl^^-^IGLWVI LR?G?y I CSEMCLGGLPSWIiQDPGKRIJlTrj^^ 
Lr^SRWFLQYKRGG? I lAVQVENE YGSYNKDPAyM?YVKKALEDRGIVELIJ:;TSDNKDG 
LSKGIVQGVIJlTINi:.QSTHELOLLTTFLFNVQGTQPKMVMEYWTGWFDSWGGPHNILDS 
SEVLKTVSAIVDAGSSINLy>;FKGGTNFGFmGAMKFHDrKSDVTSYi:TOAVLTEAGDY 
TAKYI^KLRDFFGSISGIPLPFPPDLLPKMPYSPLTPVLYLSLWDALKyLGEPIKSEKPI 
Kl^iENLP WGGNGQS FGYILYETS IT S SGILSG^r^/I^RGQVFVNTVS I GFLDYKTTKI AV 
F L I QG YT\1.RI LVEHRGRVKYGENI CDQRKGL I GNLYLNDS PIxKNFRIY SLDMKKS FFQ 
PFGLDKWXSLPETFTLPAFFLGSLSISSTPCDTFLKLSGWEKGVVFINGQNLGRYWNIG 
F OKTLYLP G P WLS S G INQV I VFEETI4AG P ALQFTETPHLGRNQY IK 
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FIGTIRF. 6^ 

GG.-.^^CAAA.^CCGGGCTCGGCCCC-AGGCCCCCAGGACCTCCATCTCCCAATGT'TGGAGGAJiTC 
CGACACGTGACGGTCTGTCCGCCGTCTCAGACTAGAGGAGCGCTGTAAACGCCATGGCTCCC 




CGTTCCGCTATGTGTCTGGCAGCCTGCACTACTTTCGGGTACCGCGGGTGCTTTGGGCCGAC 
CGGCTTTTGAAGATGCGATGGAGCGGCCTCAACGCCATACAGTTTTATGTGCCCTGGAACTA 
CCACGAGCCACAGCCTGGGGTCTATAACTTTAATGGCAGCCGGGACCTCATTGCCTTTCTGA 
ATGAGGCAGCTCTAGCGAACCTGTTGGTCATACTGAGACCAGGACCTTACATCTGTGCAGAG 
TGGGAGATGGGGGGTCTCCCATCCTGGTTGCTTCGAAAACCTGAAATTCATCTAAGAACCTC 
AGATCCAGACTTCCTTGCCGCAGTGGACTCCTGGTTCAAGGTCTTGCTGCCCAAGATATATC 
CATGGCTTTATCACAATGGGGGCAACATCATTAGCATTCAGGTGGAGAATGAATATGGTAGC 
TACAGAGCCTGTGACTTCAGCTACATGAGGCACTTGGCTGGGCTCTTCCGTGCACTGCTAGG 

^^^^?S^X^^^^'^^"^'^^ac^c:agatgggcctgaaggactcaagtgtggctccctccggg 

GACTCTATACCACTGTAGATTTTGGCCCAGCTGACAACATGACCAAAATCTTTACCCTGCTT 

cggaagtatgaaccccatgggccattggtaaactctgagtactacacaggctggctggatta 

S^;^^^SS^^Sf^^^^^^^^^^^^^^^T^TGTCAGCTGTAACCAAAGGACTAGAGA^^ 

r$^^-S^;'^S^^^^^^^^'^^^^A^^TACATGTTCCATGGAGGTACCAAOT^ 

AATGGTGCCGATAAGAAGGGACGCTTCCTTCCGATTACTACCAGCTATGACTATGATGCACC 

TATATCTGAAGCAGGGGACCCCACACCTAAGCTTTTTGCTCrrCGAGATGTCATCAGCA^^ 

;SS^2S'^^^^^^-^"^^^^^^TTTACCTCCCCCGAGCCCCAAGATGATGCTTGGA^ 

ACTCTGCACCTGGTTGGGCATTTACTGGCTTTCCTAGACTTGCTTTGCCCCCGTGGGCCCAT 

TCATTCAATCTTGCCAATGACCTTTGAGGCTGTCAAGCAGGACCATGGCTTCATGTTGTACC 

GAACCTATATGACCCATACCATTTTTGAGCCAACACCATTCTGGGTGCCAAATAATGGAGTC 

CATGACCGTGCCTATGTGATGGTGGATGGGGTGTTCCAGGGTGTTGTGGAGCGAAATATGAG 

^S^^^^T^I^-'^'^^A^^^^-GAA^CTGGGGTCCAAACTGGATATCTTGGTGGAGAACATGG 

^^^S:$^;^^?S^^^^°^^^^^^G^GTGACTTCAAGGGCCTGTTGAAGCCAC^^ 
GGGCAAACAATCCTTACCCAGTr:naTazi'rr''rT'r-r-r-'rr*T>/-7v> ^ -^r^^^^^rr,, * 




^^^-^S^'S^^^J^^^^^^GTGCCAAGATTCCTGCTGTrrCCTAGGC^ 

CATTGCTGGAACTAGAAGATGTACCTCTCCAGCCCCAAGTCCAATTTTTGGATAAGCCTATC 

?^E^'i;S^^^^^^?I^STi:^^^ACAGGACACATATC^TTCCC^ 

^■"XS:?^-2*^?^^*i^^^^'^'^^GTGGGCACTGAAAGGTAGGCCGGGC^^ 

AT^G X GGGCACCTCTAATCCCAGCTACTTGGGAGGCTGAGGGCAGGAGAATTGCTTGAATCC 
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FTGIJRE 66 

MHWSGL:.^-.IOrr/PVJNYHErQ?G^/YNF^^GSRDLIAFLNE^^^ 

EMGGLP5WLLRK?EIHLRTSDFDriJ^2iVDSVv"KVLLFKIY?WLYHNGGNIISI 

GSYRACDFSYMRHLAGLFRALLGEKILLFTTDGPEGLKCGSLRGLYTTVDFGPADNMTK 

IFTLLRKYEPHGFLWSEYYTGWLDYWGQNKSTRSVSAVTKGLEim.KLa^^VNKY^ 

GGTNFGYWNGADKKGRFLPITTSYDYDAPISSAGDPTPKLFALRDVISKFQEVPLGPLP 

PFS PKMKLGPVTLHLVGHLIAFLDLLCPRG? IHS ILPMTFEAVKQDHQFMLYRTYMTHT 

I FE P 7 ? FWPIWGVHDRAYVIWDGVFQGVVERNMRDKLF LTGKLGS KLDI LVENMGRLS 

FGSNSSDFKGLLKPPILGQTILTQWMMFPIxKIDNLVKWWFPLQLPKWPYPQAPSGPTFY 

SKTFPILGSVGDTFLYLFGWTKGQVWINGFNLGRYWTKQGPQQTLYVPRFLLFPRGALN 

'^-TLLELEDVPLQPQVQFLDKPILNSTSTLHRTKINSLSADTLSASEPMELSGH 



67 / 128 



wo 99/14328 



P<7r/US98/ 19330 



FTGTJRF. 67 

GCTTTG^Ji.CACGTCTGCAAGCCCAAAGTTGAGCATCTGATTGGTTATGAGGTATTTGAGTGC 
ACCCACAATATGGCTTACATGTTGAAAAAGCTTCTCATCAGTTACATATCCATTATTTGTGT 
TTATGGCTTTATCTGCCTCTACACTCTCTTCTGGTTATTCAGGATACCTTTGAAGGAATATT 
CTT T CG AAAAAG T C AGAG AAGAGAG CAGTTTTAGTGACATTC CAGATGTCAAAAACG ATTTT 
GCGTTCCTTCTTCACATGGTAGACCAGTATGACCAGCTATATTCCAAGCGTTTTGGTGTGTT 
CTTGTCAGAAGTTAGTGAAAATAAACTTAGGGAAATTAGTTTGAACCATGAGTGGACATTTG 
AAAAACTCAGGCAGCACATTTCACGCAACGCCCAGGACAAGCAGGAGTTGCATCTGTTCATG 
CTGTCGGGGGTGCCCGATGCTGTCTTTGACCTCACAGACCTGGATGTGCTAAAGCTTGAACT 
AATTCCAGAAGCTAAAATTCCTGCTAAGATTTCTCAAATGACTAACCTCCAAGAGCTCCACC 
TCTGCCACTGCCCTGCAAAAGTTGAACAGACTGCTTTTAGCTTTCTTCGCGATCACTTGAGA 
TGCCTTCACGTGAAGTTCACTGATGTGGCTGAAATTCCTGCCTGGGTGTATTTGCTCAAAAA 
CCTTCGAGAGTTGTACTTAATAGGCAATTTGAACTCTGAAAACAATAAGATGATAGGACTTG 
AATCTCTCCGAGAGTTGCGGCACCTTAAGATTCTCCACGTGAAGAGCAATTTGACCAAAGTT 
CCCTCCAACATTACAGATGTGGCTCCACATCTTACAAAGTTAGTCATTCATAATGACGGCAC 
TAAACTCTTGGTACTGAACAGCCTTAAGAAAATGATGAATGTCGCTGAGCTGGAACTCCAGA 
ACTGTGAGCTAGAGAGAATCCCACATGCTATTTTCAGCCTCTC7AATTTACAGGAACTGGAT 
TTAAAGTCCAATAACATTCGCACAATTGAGGAAATCATCAGTTTCCAGCATTTAAAACGACT 
GACTTGTTTAAAATTATGGCATAACAAAATTGTTACTATTCCTCCCTCTATTACCCATGTCA 
AAAACTTGGAGTCACTTTATTTCTCTAACAACAAGCTCGAATCCTTACCAGTGGCAGTATTT 
AGTTTACAGAAJICTCAGATGCTTAGATGTGAGCTACAACAACATTTCAATGATTCCAATAGA 
AATAGGATTGCTTCAGAACCTGCAGCATTTGCATATCACTGGGAACAAAGTGGACATTCTGC 
CAAA;i.CAATTGTTTAAATGCATAAAGTTGAGGACTTTGAATCTGGGACAGAACTGCATCACC 
TCACTCCCAGAGAAAGTTGGTCAGCTCTCCCAGCTCACTCAGCTGGAGCTGAAGGGGAACTG 
CTTGGACCGCCTGCCAGCCCAGCTGGGCCAGTGTCGGATGCTCAAGAAAAGCGGGCTTGTTG 
TGGAAGATCACCTTTTTGATACCCTGCCACTCGAAGTCAAAGAGGCATTGAATCAAGACATA 
AATATTCCCTTTGCAAATGGGATTTAAACTAAGATAATATATGCACAGTGATGTGCAGGAAC 
AACT7CCTAGATTGCAAGTGCTCACGTACAAGTTATTACAAGATAATGCATTTTAGGAGTAG 
ATACATCTTTTAAAATAAAACAGAGAGGATGCATAGAAGGCTGATAGAAGACATAACTGAAT 
GTTCAATGTTTGTAGGGTTTTAAGTCATTCATTTCCAAATCATTTTTTTTTTTCTTTTGGGG 
^'JLA.GGGAAGGAAAAATTATAATCACTAATCTTGGTTCTTTTTAAATTGTTTGTAACTTGGAT 
GCTGCCGCTACTGAATGTTTACAAATTGCTTGCCTGCTAAAGTAAATGATTAAATTGACATT 
TT CTTACTAAAAAAAAAAAAAAAAA 
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FIGTJRF fiS 

><££.DKA344C7 

xsuburiit 1 of 1 , 501 aa, 1 stop 
><m:: 57815, pi: 8.15, NX{S/T}: 3 

MAYMLKKLL I £ Y I S 1 1 CVYG F I CLYTLFWLFR I PLKE YS FEKVREES SFS DI PDVKNDFA 
FLLKr^TDQ'fDQLYS:<RFGVTLSEVSENKIJl2ISI2::-3WT^^ 

MLSGVPDA'/FDLTDLDVLKLELIPEAKIPAKISQMTNLQELHLCKCPAKVEQTAFSFLRD 
KLRCLH^v"<FTDVAE I ? AVmrijLKNLRELYL I GNLNSSNNKMI GLESLREIJU^ 
NLTKVPSKITDVAPKLTKLVIHNDGTKLLVLNSLKKMMWAELELQNCELERIPHAIFSL 
SNLQELDLKSNNI RTI EE 1 1 S FQHLKRLTCLKLWH^1CI VTI PPS ITHVKNLESLYFSNNK 
LES LP VA'\^SLQKIJ^CLDVS YNNI SM I P I E I GLLO^^:.QKI^I TGNXVDI LPKQLFKCI KL 
RTLK^GQKCITSLPEKVGQLSQLTQLELKGNCLDRLPAOLGQCRKLKKSGLVVEDHLFDT 
LPLEV7CSALNQDINI PFANGI 
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FIGURE m 

CCCACGCC-TCCGGCCTTCTCTCTGGACTTTGCATTTCCATTCCTTTTCATTGACAAACTGAC 

TTTTTTTATTTCTTTTTTTCCATCTCTGGGCCAGCTTGGGATCCTAGGCCGCCCTGGGAAGA 

CATTTGTGTTTTACACACATAAGGATCTGTGTTTGGGGTTTCTTCTTCCTCCCC^GACAT"G 

GCATTGCTTAGTGGTTGTGTGGGGAGGGAGACCACGTGGGCTCAGTGCTTGCTTGCACTTAT 

CTGCCTAGGTACATCGAAGTCTTTTGACCTCCATACAGTGATTATGCCTGTCATCGCTGGTG 

GTATCCTGGCGGCCTTGCTCCTGCTGATAGTTGTCGTGCTCTGTCTTTACTTCAAAATACAC 

AACGCGCTAAAAGCTGCAAAGGAACCTGAAGCTGTGGCTGTAAAAAATCACAACCCAGACAA 

GGTGTGGTGGGCCAAGAACAGCCAGGCCAAAACCATTGCCACGGAGTCTTGTCCTGCCCTGC 

AGTGCTGTGAAGQATATAGAATGTGTGCCAGTTTTGATTCCCTGCCACCTTGCTGTTGCGAC 

ATAAATGAGGGCCTCTGAGTTAGGAAAGGCTCCCTTCTCAAAGCAGAGCCCTGAAGACTTCA 

ATGATGTCAATGAGGCCACCTGTTTGTGATGTGCAGGCACAGAAGAAAGGCACAGCTCCCCA 

TCAGTTTCATGGAAAATAACTCAGTGCCTGCTGGGAACCAGCTGCTGGAGATCCCTACAGAG 

AGCTTCCACTGGGGGCAACCCTTCCAGGAAGGAGTTGGGGAGAGAGAACCCTCACTGTGGGG 

AATGCTGATAAACCAGTCACACAGCTGCTCTATTCTCACACAAATCTACCCCTTGCGTGGCT 

GGAACTGACGTTTCCCTGGAGGTGTCCAGAAAGCTGATGTAACACAGAGCCTATAAAAGCTG 

TCGGTCCTTAAGGCTGCCCAGCGCCTTGCCAAAATGGAGCTTGTAAGAAGGCTCATGCCATT 

GACCCTCTTAATTCTCTCCTGTTTGGCGGAGCTGACAATGGCGGAGGCTGAAGGCAATGCAA 

GCTGCACAGTCAGTCTAGGGGGTGCCAATATGGCAGAGACCCACAAAGCCATGATCCTGCAA 

CTCAATCCCAGTGAGAACTGCACCTGGACAATAGAAAGACCAGAAAACAAAAGCATCAGAAT 
rArCTTTTCCTATGTCCAGCTTniiTri-iinRTrv^A 7Nr!^oPT.^-n ■« , , , ™™ 




"^^^^ A wj.vjui>i\jrt^L-AAL»iUiULJ\UTAAAAAC^ 

CCTGTATTTGAATCATCATCCAGTACATTGACGTTTCAAATAGTTACTGACTCAGCAAGAAT 
TCAAAGAACTGTCTTTGTCTTCTACTACTTCTTCTCTCCTAACATCTCTATTCCAAACTGTG 
GCGGTTACCTGGATACCTTGGAAGGATCCTTCACCAGCCCCAATTACCCAAAGCCGCATCCT 




. . ^ v.^^^^ ^ i V i ^i^s^ i ^ i 1 U i-rCiTCrACAGATTATGCCAATTCTTACCGGGGATT 
TTCTGCTTCCTACACCTCAATTTATGCJ^GAAAACATCAACACTACATCTTTAAC^GCTCTT 
CTGACAGGATGAGAGTTATTATAAGCAAATCCTACCTAGAGGCTTTTAACTCTAATGGGAAT 
AACTTGCAACTAAAAGACCCAACTTGCAGACCAAAATTATCAAATGTTGTGGAATTTTCTGT 

CCCTCTTAATGGATGTGGTArART^•ar:atn^^r!T>7^r!T^^^r^l^rro-^^rT,^, ,m,T,,^m^ 




CTCT 

ATGAAACT 



AAATTCTTGAGAAGTATGAGCTCTGTGTATCTGCAGTGTAAAGTTTTGATATGTGATAGCAG 

ATAAATGGAAAACAGATTCCATCATAGGACCCATTCGTCTGAAAAGGGATCGAAGTGCaAGT 

GGCAATTCAGGATTTCAGCATGAAACACATGCGGAAGAAACTCCAAACCAG^Cm 

TGTGCATCTGTTTTCCTTCATGGTTCTAGCTCTGAATGTGGTGACTOTA^CGACAAT^ 
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><;SS .DNA3 5B41 

xsubunir I ot 1 , 607 aa, 1 stop 

xMW: 68153, p.: 6.33, NXtS/T): 5 

MS V/R.^IIJ^?LTLL ILS CIAELTMJiEAE GNAS CTVS LGGAJ^^ 

T I ER?SK~<S I RI I FS YVQLDPDGS CESEKI KyFDGTSSNGPLLGQVCS KNDYVPVrES S S 
STLTr Q I VTDS ARI QRT V? VFYYF F S FNI S I PNCGGYLDTLEGSFTS PNYPKPHPELAYC 
Vl\>: I QVEKDYKI KIJ^KE I FLE I DKC CKFDFLAI YDGPSTNSGLI GQVCGRVTPTFE S SS 
KSLTVv^STDYAKSYRGFSAS YTS I YAENINTTSLTCSSDRMRVI ISKSYLEAFNSNGNN 
LQLKDPTCRPXLSNWEFSVFLNGCGTIRKVEDQS ITYTNI ITFSASSTSEVITRQKQLQ 
1 1 VKCEMGKNSTVE 1 1 YITEDDVI QSQKALGKYNTSMALFESNSFSKTILESPYYVDLNQ 
TLFVQVSIjrrSDPNL-VVTLDTCRASPTSDFii^PTYDLIKSGCSRDSTCIC^PIiFGHYGRF 
QFNAFKr LRSMS SVYLQCKVLI CD SSDHQSRCNQGCVSRSKRDI SS YKWKTDS 1 1 CP IRL 
KRDRS AS GNS GF QHSTHAEET PNQ P FN SVHLF S FMVLALNVVTVAT I TVRKF VNQRAD YK 
YQKLQlTf 
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nGTJR E 71 

GGrtCATGCGGCCCCAGGAGCTCCCCAGGCTCGCGTTCCCG"TGCTGCTGTTGC'^GTTGCTGC 

TGCTGCCGCCGCCGCCGTGCCCTGCCCACAGCGCCACGCGCTTCGACCCCACCTGGGAGTCC 

CTGGACGCCCGCCAGCTGCCCGCGTGGTTTGACCAGGCCAAGTTCGGCATCTTCATCCACTG 

GGGAGTGTTTTCCGTGCCCAGCTTCGGTAGCGAGTGGTTCTGGTGGTATTGGCAAAAGGAAA 

AGATACCGAAGTATGTGGAATTTA.TGAAAGATAATTACCCTCCTAGTTTCAAATATGAAGAT 

TTTGGACCACTATTTACAGCAAAATTTTTTAATGCCAACCAGTGGGCAGATATTTTTCAGGC 

CTCTGGTGCCAAATACATTGTCTTAACTTCCAAACATCATGAAGGCTTTACCTTGTGGGGGT 

CAGAATATTCGTGGAACTGGAATGCCATAGATGAGGGGCCCAAGAGGGACATTGTCAAGGAA 

CTTGAGGTAGCCATTAGGAACAGAACTGACCTGCGTTTTGGACTGTACTATTCCCTTTTTGA 

ATGGTTTCATCCGCTCTTCCTTGAGGATGAATCCAGTTCArrCCATAAGCGGCAATTTCCAG 

TTTCTA.J^GACATTGCCAGAGCTCTATGAGTTAGTGAACAACTATCAGCCTGAGGTTCTGTGG 

TCGGATGGTGACGGAGGAGCACCGGATCAATACTGGAACAGCACAGGCTTCTTGGCCTGGTT 

ATATAATGAAAGCCCAGTTCGGGGCACAGTAGTCACCAATGATCGTTGGGGAGCTGGTAGCA 

TCTGTAAGCATGGTGGCTTCTATACCTGCAGTGATCGTTATAACCCAGGACATCTTTTGCCA 

CATAAATGGGAAAACTGCATGACAATAGACAAACTGTCCTGGGGCTATAGGAGGGAAGCTGG 

AATCTCTGACTATCTTACAATTGAAGAATTGGTGAAGCAACTTGTAGAGACAGTTTCATGTG 

GAGGAJU^TCTTTTGATGAATATTGGGCCCACACTAGATGGCACCATTTGTGTAGTTTTTGAG 

GAGCGACTGAGGCAAGTGGGGTCCTGGCTAAAAGTCAATGGAGAAGCTATTTATGAAACCTA 

TACCTGGCGATCCCAGAATGACACTGTCACCCCAGATGTGTGGTACACATCCAAGCCTAAAG 

AAAAATTAGTCTATGCCATTTTTCTTAAATGGCCCACATCAGGACAGCTGTTCCTTGGCCAT 

CCCAAAGCTATTCTGGGGGCAACAGAGGTGAAACTACTGGGCCATGGACAGCCACTTAACTG 

GATTTCTTTGGAGCAAAATGGCATTATGGTAGAACTGCCACAGCTAACCATTCATCAGATGC 
CGTGTAAATGGGGCTGGGCTrTAr-nr^Ta ^o^t^ n^r^n^r^y^JZ^^^t^^^ ^^'Ztt^^ 



GT^^^.G-AAAAATACTGTAAAATAAATGGTGCACCTGTATAGGGCACTTACCACGAATGGAG 

cttacaggactgg,aagttgctctgggtgagtc:agtgagtgaatgtgaaggcct^^ 



tttaaaacc 
atgtaa 
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FIGURE 72 

M^?OEL?RLArPLLLLLLLLIj????C?AHSATRrDFTWTSLDARQLPAWFDQAK?G^ 
HWCTv r S V? S FGS2WFWWTWQKEKI PK'rvi FKJa}NYFP S FiaTDFG PLFTAKFFTTAI^QWA 

d i fqas gaky i vt^tskrz-^igftlwgshys wkvznai degpkrd i vkelevai rnrtdlrf 
glyyslfewfi^tlflfdesssfkkrqfpvsktlpelyelvnnyqpevlwsdgdggapdq 
ywnstgfiawlynespvkgtvvt^T)rwgagsickhggfytcsdrynpghllph:^ 
tidklswgyrheagisdyltieelvxqlvftvscggnllmnigptldgtiswfezrlr 

QVGSWLKVI^'GEAIYETYTWRSQNDTVTPDVWYTSKPKEKLVYAXFLK^^ 
KAILGArFVKLLGHGQPLNW:SLSQNGIMVSLPQLTIHQMPCKWGWALJa.TNVI 
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FIGURE 73 

AGCAGGGAAATCCGGATGTCTCGGTTATGAAGTGGAGCAGTGAGTGTGAGCCTCAACATAC-T 
'^S-iS^-F^I^^^^'^^^^^^-TA-TTATTGAGCATCTGCCTCTCATATCACCAGTGGCCAT^ 
TGA'j^'TGiTTCCCTGGCTCTGAAGGGGTAGGCACGATGGCCAGGTGCTTCAGCCTGGTGTT- 
CTTGTCACTTCCATCTGGACCACGAGGCTCCTGGTCCAAGGCTCTTTGCGTGCAGAAGAGCT 
TTCCATCCAGGTGTCATGCAGAATTATGGGGATCACCCTTGTGAGCAAJU^JLGGCGAACCAGC 
AGGTGAATTTCACAGAAGCTA-AGGAGGCCTGTAGGCTGCTGGGACTAAGTTTGGCCGGCAAG 
GACCAAGTTGAAACAGCCTTGAAAGCTAGCTTTGAAACTTGCAGCTATGGCTGGGTTGGAGA 
TGGATTCGTGGTCATCTCTAGGATTAGCCCAAACCCCAAGTGTGGGAAAAATGGGGTGGGTG 
TCCTGATTTGGAJi.GGTTCCAGTGAGCCGACAGTTTGCAGCCTATTGTTACAACTCATCTGAT 
ACTTGGACTAACTCGTGCATTCCAGAAATTATCACCACCAAAGATCCCATATTCAACACTCA 
AACTGCAACACAAACAACAGAATTTATTGTCAGTGACAGTACCTACTCGGTGGCATCCCCTT 
ACTCTACAATACCTGCCCCTACTACTACTCCTCCTGCTCCAGCTTCCACTTCTATTCCACGG 
AGAAJIAAAATTGATTTGTGTCACAGAAGTTTTTATGGAAACTAGCACCATGTCTACAGAAAC 
TGAACCATTTGTTGAAAATAAAGCAGCATTCAAGAATGAAGCTGCTGGGTTTGGAGGTGTCC 
CCACGGCTCTGCTAGTGCTTGCTCTCCTCTTCTTTGGTGCTGCAGCTGGTCTTGGATTTTGC 
TATGTCAAAAGGTATGTGAAGGCCTTCCCrrTTTACAAACAAGAATCAGCAGAAGGAAATGAT 
CGAAJ^CCAAAGTAGTAAAGGAGGAGAAGGCCAATGATAGCAACCCTAATGAGGAATCAAAGA 
AAACTGATAAAJ^CCCAGAAGAGTCCAAGAGTCCAAGCAAAACTACCGTGCGATGCCTGGAA 
GCTGAAGTTTAGATGAGACAGAAATGAGGAGACACACCTGAGGCTGGTTTCTTTCATGCTCC 
TTACCCTGCCCCAGCTGGGGAAATCAAAAGGGCCAAAGAACCAAAGAAGAAAGTCCACCCTT 
GG-TCCTAACTGGAATCAGCTCAGGACTGCCATTGGACTATGGAGTGCACCAAAGAGAATGC 
CCTTCTCCTTATTGTAACCCTGTCTGGATCCTATCCTCCTACCTCCAAAGCTTCCCACGGCC 
TTTCTAGCCTGGCTATGTCCTAATAATATCCCACTGGGAGAAAGGAGTTTTGCAAAGTGCAA 
GGACCTAAAACATCTCATCAGTATCCAGTGGTAAAAAGGCCTCCTGGCTGTCTGAGGCTAGG 
TGGGTTGAAAGCCAAGGAGTCACTGAGACCAAGGCTTTCTCTACTGATTCCGCAGCTCAGAC 
CCTTTCTTCAGCTCTGAAAGAGAAACACGTATCCCACCTGACATGTCCTTCTGAGCCCGGTA 
AGAGCAAAAGAATGGCAGAAAAGTTTAGCCCCTGAAAGCCATGGAGATTCTCATAACTTGAG 
ACCTA.^TCTCTGTAAAGCTAAAATAAAGAAATAGAACAAGGCTGAGGATACGACAGTACACT 
GTCAGCAGGGACTGTAAACACAGACAGGGTCAAAGTGTTTTCTCTGAACACATTGAGTTGGA 
ATCACTGTTTAGAACACACACACTTACTTTTTCTGGTCTCTACCAGTGCTaATATTTTCTCT 
AGGAAATATACTTTTACAAGTAACAAAAATAAAAACTCTTATAAATTTCTATTTTTATCTGA 
GTTACAGA^TGATTACTAAGGAAGATTACTCAGTAATTTGTTTAAAAAGTAATAAAATTCA 
ACAAACATTTGCTGAATAGCTACTATATGTCAAGTGCTGTGCAAGGTATTACACTCTGTAAT 




^....^^.^^^^^ V- x/iv.-^ 1 ^/ii uiAU i'TCL'AAACTAATTTTTATTTTTGCTGAGACTAATCTT 
ATTCATTTTCTCTAATATGGCAACCATTATAACCTTAATTTATTATTAACATACCTAAGAAG 
TACATTGTTACCTCTATATACCAAAGCACATTTTAAAAGTGCCATTAACAAATGTATCACTA 

AGCA-T-^AGiSAA^^ 
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FIGUR E 74 

><S£ .DHA34:4:31 

xsubur.it 1 cf 1, 3 22 aa, 1 stop 
><:4;v: 35213, pi: 8.71, NX {£/T) : 3 

MAHCFSlVLLLTSIWTTRIuLVQGSLRAEELSIQVSCRIMGITLVSKKANQQLNFTELz^^ 
CRLLGLSLAGKDQVTT;iJ.:<A£FETCSYGWVGDGFVVISRISP^'?KCGKNGVGVLIWKVPV 
S RQ FAAY C YNS S DTWTNS CI PE 1 1 TTKD F I FNTQTATQTTE? I VS DS TY S VAS F YS T I F A 
PTTTF FAFASTS I PRRKKLI CVTEVFMETSTMSTETHPrVENKAAFKKEAA^^^ 
LVLALLrFGAAAGLGFCYVKRYVKAFPFTNKNQQKEMIETKVVKE5KANDSNP^^ 
DKNFEESKSFSKTTAmCLEAEV 
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FIGURE 7.S 

AG 

><M£T {Lran£ = l-s, cir=f, rGS=^l}> 

ATGGCGGTCTTGGCACCTCTAATTGCTCTCGTG-ATTCGGTGCCGCGACTTTCACGATGG 
CTCGCCCA;i.CCTTACTACCTTCTGrCGGCCCTGCTCTCTGCTGCCTTCCTACTCGTGAGG 
A^ACTGCCGCCGCTCTGCCACGGrCTGCCCACCCAACGCGA^.GACGGTAACCCGTGTGAC 
TTTGACTGGAGAGJi-AGTGGAGATCCTGATGTTTCTCAGTGCCi^rTGTGATGATGAAGAAC 
CGCAGATCCATCACTGTGGAGCAACATATAGGCAACATTTTCATGTTTAGTAAAGTGGCC 

;^acaca;^.ttcttttcttccggttggatattcgcatgggcctactttacatcacactctgc 

ATAGTGTTCCTGATGACGTGCAJiACCCCCCCTATATATGGGCCCTGAGTATATCAAGTAC 
TTCAJi.TGATAJiJUi.CCA.TTGATGAGGAACTAGAACGGGACAAGAGGGTCACTTGGATTGTG 
GAGTTCTTTGCC^ATTGGTCTAATGACTGGCAATCATTTGCCCCTATCTATGCTC-ACCTC 

tccctta;^-;^.taca?.ctgtacagggctaaattttgggaaggtggatgttggacgctatact 

G^TGTTAGTACGCGGTACAAAGTGAGCACATCACCCCTCACCAAGCAACTCCCTACCCTG 
ATCCTGTTCCAJi.GGTGGCJiJlGGAGGCAATGCGGCGGCCACAGATTGACAAGAAAGGACGG 
GCTGTCTCATGGACCTTCTCTGAGGAGAATGTGATCCaAGAATTTAACTTAAATGAGCTA 
TACCAGCGGGCCAAGAAACTATCAAAGGCTGGAGACAATATCCCTGAGGAGCAGCCTGTG 
GC7TCAACCCCCACCACAGTGTCAGATGGGGAAAACAAGAAGGATAAATAAGATCCTCAC 
7TTGGCAGTGCTTCC7CTCCTGTCAATTCCAGGCTCTTTCCATAACCACAAGCCTGAGGC 
TGCAGCCTTTNA7TNATGTTTTCCCTTTGGCTGNGACTGGNTGGGGCAGCATGCAGCTTC 
TGATTTTA.AAGAGGCATCTAGGGA-ATTGTCAGGCACCCTACAGGAAGGCCTGCCATGCTG 
TGGCCAACTGTTTCACTGGAGCA-AGAAAGAGATCTCATAGGACGGAGGGGGAAATGGTTT 
CCCTCC.^iJ^.GCTTGGGTCAGTGTGTTAACTGCTTATCAGCTATTCAaACATCTCCATGGTT 
rr^CCATGAAACrCTGTGGTTTCATCATTCCTTCTTAGTTGACCTGCACAGCTTGGTTAG 
ACCTAGATTTAACCCTAAGGTAAGATGCTGGGGTATAGAACGCTAAGAATTTTCCCCCAA 
GGACTCTTGGTTCCTTAAGCCCTTCTGGCTTCGTTTATGGTCTTCATTAAAAGTATAAGC 
-TAArTTTGTCGCTAGTCCTAAGGAGAAACCTTTAACCACAAAGTTTTTATCATTGAAGA 
C.A^-TATTGAACAAGCCCCTATTTTGTGGGGATTGAGAAGGGGTGAATAGAGGCTTGAGAC 
TTTCCTTTGTGTGGTAGGACTTGGAGGAGAAATCCCCTGGACTTTaACTAACCCTCTGA.C 
ATACTGCGCAGACCCAGTTGATGGCT7TCCGTAA7AAAAAGATTGGGATTTCCTTTTG 
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FIGURE 76 

M.KVIJ^.PlIAI\^iSV?RLSRViLAQ?YYLLSALLSA;iTLLVRKLPFLCH3LPTQ 

rDWR£VEIU•:^LSAI^/^::'IK^;RRSIrvTQHIGK:^:<?SK\^ANTILF^RLDIR^IGLLYI^ 

IVFI_MTCKF?lY:-:C-?EYIKYFNDKTIDE£LERDKRVTWIVEFFANsflSNDCQSFAPIY 

SLKYNCTGLNFGKVDVGRYTDVSTRYKV'STSPLTKQLPTLILFCGGKEai^RRPQIDK^ 

AVSWTF5£E^^/IREFMLNELYQRAKKLSKAGDNI?EE0PVASTrTTVSDGENKKDK 
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FTQTJRF 77 

GGACAGCrCGCC-GCCCCCGAC-AGCTCTAGCCGTCGAGGAGCTGCCTGGGGACGTTTGCCCTG 
GGGCCCCAGCGTGGCCCGGGTCACCCTGGCATGAGGAGATGGGCCTGTTGCTCCTGGTCCCA 
TTGCTCCTGCTGCCCGGCTCCTACGGACTGCCCTTCTACAACGG'GTTCTA.CTACTCCAACAG 
CGCCAACGACCAGAACCTAGGCAACGGTCATGGCAAAGACCTCCTTAATGGAGTGAAGCTGG 
TGGTGGAGACACCCGAGGAGACCCTGTTCACCTACCAAGGGGCCAGTGTGATCCTGCCCTGC 
CGC7ACCGCTACGAGCCGGCCCTGGTCTCCCCGCGGCGTGTGCGTGTCAAATGGTGGAAGCT 
GTCGGAGA-ACGGGGCCCCAGAGAAGGACGTGCTGGTGGCCATCGGGCTGAGGCACCGCTCCT 
TTGGGGACTACCAAGGCCGCGTGCACCTGCGGCAGGACAAAGAGCATGACGTCTCGCTGGAG 
ATCCAGGATCTGCGGCTGGAGGACTATGGGCGTTACCGCTGTGAGGTCATTGACGGGCTGGA 
GGATGAAAGCGGTCTGGTGGAGCTGGAGCTGCGGGGTGTGGTCTTTCCTTACCAGTCCCCCA 
ACGGGCGCTACCAGTTCAACTTCCACGAGGGCCAGCAGGTCTGTGCAGAGCAGGCTGCGGTG 
G7GGCCTCCTTTGAGCAGCTCTTCCGGGCCTGGGAGGAGGGCCTGGACTGGTGCAACGCGGG 
CTGGCTGCAGGATGCTACGGTGCAGTACCCCATCATGTTGCCCCGGCAGCCCTGCGGTGGCC 
CAGGCCTGGCACCTGGCGTGCGAAGCTACGGCCCCCGCCACCGCCGCCTGCACCGCTATGAT 
GTATTCTGCTTCGCTACTGCCCTCAAGGGGCGGGTGTACTACCTGGAGCACCCTGAGAAGCT 
GACGCTGACAGAGGCAAGGGAGGCCTGCCAGGAAGATGATGCCACGATCGCCAAGGTGGGAC 
AGCTCTTTGCCGCCTGGAAGTTCCATGGCCTGGACCGCTGCGACGCTGGCTGGCTGGCAGAT 
GGCAGCGTCCGCTACCCTGTGGTTCACCCGCATCCTAACTGTGGGCCCCCAGAGCCTGGGGT 
CCGAAGCTTTGGCTTCCCCGACCCGCAGAGCCGCTTGTACGGTGTTTACTGCTACCGCCAGC 
ACTAGGACCTGGGGCCCTCCCCTGCCGCATTCCCTCACTGGCTGTGTATTTATTGAGTGGTT 
CGTTTTCCCTTGTGGG7TGGAGCCATTTTAACTGTTTTTATACTTCTCAATTTAAATTTTCT 
TTAAACATTTTTTTACTATTTTTTGTAAAGCAAACAGAACCCAATGCCTCCCTTTGCTCCTG 
GATGCCCCACTGCAGGAATCATGCTTGCTCCCCTGGGCCATTTGCGGTTTTGTGGGCTTCTG 
GAGGQTTCCCCGCCATCCAGGCTGGTCTCCCTCCCTTAAGGAGGTTGGTGCCCAGAGTGGGC 
GGTGGCCTGTCTAGAA.TGCCGCCGGGAGTCCGGGCATGGTGGGCACAGTTCTCCCTGCCCCT 
CAGCCTGGGGGAAGAAGAGGGCCTCGGGGGCCTCCGGAGCTGGGCTTTGGGCCTCTCCTGCC 
CACCTCTACTTCTCTGTGAAGCCGCTGACCCCAGTCTGCCCACTGAGGGGCTAGGGCTGGAA 
GCCAGTTCTAGGCTTCCAGGCGAAATCTGAGGGAAGGAAGAAACTCCCCTCCCCGTTCCCCT 
TCCCCTCTCGGTTCCAAAGAATCTGTTTTGTTGTCATTTGTTTCTCCTGTTTCCCTGTGTGG 
GGAGGGGCCCTCAGGTGTGTGTACTTTGGACAATAAATGGTGCTATGACTGCCTTCCGCCAA 
AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
A-WVAJJW^AAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
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FIGUR E 78 

><ss .DNA3 94 2J 

><s^-bunit 1 of 1, 360 aa, i stop 
xMVJ: 40854, pi: 6.44, NX (S/T) : 0 

r-lGLL:.liV?LLlLFGSYGLP~YNGFYYSNSANDQNa.GNGHG?3IJ:A'GVKL^VET?ZETL?T 

yqgasvi:.pcryryz?alvs?rrwvkwwklsengape:<dvlvaiglj^^^ 
lrqdkzhdvs le i qdlrled ygr yrcevi dgledes glvelelrg wfpyqs pngryq fn 
?hegqqvcaeqaawa£feclfrawsegldwcnagwlqdatvqy?iml?rqpcggfgla? 
g^/ks yg ? rhrrlihr yd vf cf at al kgrvyyleh pe kltlte are acqedd at i akvgql f 
aavc<fhgij:rcdagwt:jij)gs^/ryfvvh?hpncgppepgvr£fgf?d?qs^ 
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FIQTJRF 79 

GGAGAGCGGAGCGAAGCTGGATAACAGGGGACCG 
xMHT {trans = l-s, dir = f, res = l} 

AT GATGTGG CG AG CAT CAGTTGTG C7G CrxCTGTTGC i ACTGAGG GAGGGGGCCCAGGGG 
AAGCCATCCCCAGACGCAGGCCCTCATGGCCAC-C-GGAGGGTGCACCAGGCGGGCCCCCTG 
AGCGACG'GTCCCCATGATGACGCCCACGGGAACTTCCAGTACaACCATGAGGCTTTCCTG 
GGACGGGAAGTGGC'CAAGGAATTCGACCAACTCACCCCAGAGGAAAGCCAGGCCCGTCTG 
GGGCGGATCGTGGACCGCATGGACCGCGCGGGGGACGGCGACGGCTGGGTGTCGCTGGCC 
GAGCTTCGCGCGTGGATCGCGCACACGCAGCAGCGGCACATACGGGACTCGGTGAGCGCG 
GCCTGGGACACGTACGACACGGACCGCGACGGGCGTGTGGGTTGGGAGGAGCTGCGCAAC 
GCCACCTATGGCCACTACGCGCCCGGTGAAGAATTTCATGAGGTGGAGGATGCAGAGACC 
TACAAAAAGATGCTGGCTCGGGACGAGCGGCGTTTCCGGGTGGCCGACCAGGATGGGGAC 
TCGATGGCCACTCGAGAGGAGCTGACAGCCTTCCTGCACCCCGAGGAGTTCCCTrCACATG 
CGGGACATCGTGATTGCTGAAACCCTGGAGGACCTGGACAGAAACAAAGATGGCTATGTC 
CAGGTGGAGGAGTACATCGCGGATCTGTACTCAGCCGAGCCTGGGGAGGAGGAGCCGGCG 
TGGGl'GCAGACGGAGAGGCAGCAGTTCCGGGACTTCCGGGATCTGAACAAGGATGGGCAC 
CTGGATGGGAGTGAGGTGGGCCACTGGGTGCTGCCCCCTGCCCAGGACCAGCCCCTGGTG 
GAAGC'GAACCACCTGCTGCACGAGAGCGACACGGACAAGGATGGGCGGCTGAGCAAAGCG 
GAAATCCTGGGTAATTGGAACATGTTTGTGC-GCAGTCAGGCCACCAACTATGGCGAGGAC 
CTGACCCGGCACCACGATGAGCTGTGAGCACCGCGCACCTGCCACAGCCTCAGAGGCCCG 
CACAATGACCGGAGGAGGGGCCGCTGTGGTCTGGCCCCCTCCCTGTCCAGGCCCCGCAGG 
AGGCAGATGCAGTCCCAGGCATCCTCCTGCCCCTGGGCTCTCAGGGACCCCCTGGGTCGG 
CTTCTGTCCCTGTCACACCCCCAACCCCAGGGAGGGGCTGTCATAGTCCCAGAGGATAAG 
CAATACCTATTTCTGACTGAGTCTCCCAGCCCAGACCCAGGGACCCTTGGCCCCAAGCTC 
AGCTCTAAGAACCGCCCCAACCCCTCCAGCTCCAAATCTGAGCCTCCACCACATAGACTG 
AAACTCCCCTGGCCCCAGCCCTGTCCTGCCTGGCCTGGCCTGGGACACCTCCTCTCTGCC 
AGGAGGCAATA^JIGCCAGCGCCC-GGACCTTGAAAAAAAAAAAAAAAAAAAAAAAAAAAA 
AAAAA^JLA^JiAJ^AAAAAJiJVAAJiAAAAAAA 
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FIfflJKE 80 

X / u s r / 3 e qdi: 2 / s 5 1 / DNA/Dr,a seqs.min/ss. DNA4 06 20 
>< sub-nit: 1 of 1, 32 E aa, C stop 
:.<m: 3 7433, pi: 4.77, ^^(S/TJ: 1 

MMWHPSVLLLLLLLRHGAQGKPSPDAGPHGQGRVHQAAFLSDAFKDDAHGNFQYDHEAFL 
GRI:VA:<EF3QLTPEZ3QARLGRIVDRMDRAGDGDGWSLAELRAWIAHTQQRHIRDSVSA 
AWDTYOTDRDGRVGWHz:LRKATTG^r^APGEEFKDVEDAETYKK>IlJlRDE^ 
SI-lATRZELTAFLHPSZPPPJ^RDIVIA-ETLEBLDRNKDGYVQVEEYIADLySAEPGEEEPA 
vr^QTE RQQFRD FRDIiNKDGKLDGS E VGHWVL F P AQDQ PLVEANHLLKE SDTDKDGR^ 
E I LGN'W^'Mrv G3 QATNYGSDLTRHHDEL 
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FIGURE 81 

C-GGGCCTTGCCT7CCGCACTCQG3CC-CAGCCGGGTGGATCTCGAGCAGGTGCGGAGCCCC 

GGGCGGCGGGGGCGGGTGCGAGGGATCCCTGACGCCTCTGTCCCTGTTTCTTTGTCGCTC 

CCAGCCTGTCTGTCGTCGTTTTGGCGCCCCCGCCTCCCCGCGGTGCGGGGTTGCACACCG 

ATCCTGGGCT7CGCTCGATTTGCCGCCGAGGCGCCTCCCAGACCTAGAGGGGCGCTGGCC 

TGGAGCAGCGC-C-TCGTCTGTGTCCTCTCTCCTCTGCGCCGCGCCCGGGGATCCGAAGGG'^ 

GCGGGGCTCTGAGGAC-G7GACGCGCGGGGCCTCCCGCACCCTGGCCTTGCCCGCATTCTC 

CCTCTCTCCCAGGTGTGA.GCAGCCTATCAGTCACC 

xMET (trans = l-s, dir=f, res = l} 

ATGTCCGCAGCCTC-GATCCCGGCTCTCGGCCTCGGTGTGTGTCTGCTGCTGCTGCCGGGG 
CCCGCGGGCAGCGAGGGAGCCGCTCCCATTGCTATCACATGTTTTACCAGAGGCTTGGAC 
ATCAGGAJiJlGAGAAJVGCAGATGTCCTCTGCCCAGGGGGCTGCCCTCTTGAGGAATTCTCT 
GTGTATGGGAACATAGTATATGCTTCTGTATCGAGCATATGTGGGGCTGCTGTCCACAGG 
GGAGTA-ATCAGCAJ^CTCAGGGGGACCTGTACGAGTCTATAGCCTACCTGGTCGAGAAAAC 
TATTCCTCAGTAGATGCCAATGGCATCCAGTCTCAAATGCTTTCTAGATGGTCTGCTTCT 
TTCACAGTAACTAAAGGCAAAAGTAGTACACAGGAGGCCACAGGACAAGCAGTGTCCACA 
GCACATCCACCAACAGGTAAACGACTAAAGAAAACACCCGAGAAGAAAACTGGCAATAAA 
GATTGTAAAGCA.GACATTGCATTTCTGATTGATGGAAGCTTTAATATTGGGCAGCGCCGA 
TTT.AATTTACAGAAGAATTTTGTTGGAAAAGTGGCTCTAATGTTGGGAATTGGAACAGAA 
GGACCACATGTGGGCCTTGTTCAAGCCAGTGAACATCCCAAAATAGAATTTTACTTGAAA 
AACTTTACATCAGCCAAAGATGTTTTGTTTGCCATAAAGGAAGTAGGTTTCAGAGGGGGT 
.AATTCCAATACAGGAJ^GCCTTGAAGCATACTGCTCAGAAATTCTTCACGGTAGATGCT 
GGAGTAAGAAJ^GGGA-TCCCCAAAGTGGTGGTGGTATTTATTGATGGTTGGCCTTCTGAT 
GACATCGAGGAAGCAGGCATTGTGGCCAGAGAGTTTGGTGTCAATGTATTTATAGTTTCT 
GTGGCCAJIGCCTATCCCTGAAGAACTGGGGATGGTTCAGGATGTCACATTTGTTGACAAG 
GCTGTCTGTCGGAATAJkTGGCTTCTTCTCTTACCACATGCCCAACTGGTTTGGCACCACA 
AAATACGTAAAGCCTCTGGTACAGAAGCTGTGCACTCATGAACAAATGATGTGCAGCAAG 
ACCTGTTATAACTCAGTGAACATTGCCTTTCTAATTGATGGCTCCAGCAGTGTTGGAGAT 
AGCAATTTCCGCCTCATGCTTGAATTTGTTTCCAACATAGCCAAGACTTTTGAAATCTCG 
GACATTGGTGCCAAGATAGCTGCTGTACAGTTTACTTATGATCAGCGCACGGAGTTCAGT 
TTCACTGACTATAGCACCAJ^GAGAATGTCCTAGCTGTCATCAGAAACATCCGCTATATG 
A3TGGTGGAACAGCTACTGGTGATGCCATTTCCTTCACTGTTAGAAATGTGTTTGGCCCT 
ATAAGGGAGAGCCCCAA.CAAGAACTTCCTAGTAATTGTCACAGATGGGCAGTCCTATGAT 
GATGTCCAJ^GGCCCTGCAGCTGCTGCACATGATGCAGGAATCACTATCTTCTCTGTTGGT 
GTGGCTTGGGCACCTCTGGATGACCTGAAAGATATGGCTTCTAAACCGAAGGAGTCTCAC 
GCTTTCTTCACAAGAGAGTTCACAGGATTAGAACCAATTGTTTCTGATGTCATCAGAGGC 
A . TTGTAGAGATTTCTTAGAATCCCAGCAATAATGGTAACATTTTGACAACTGAAAGAAA 
A^AGTACAJIGGGGATCCAGTGTGTAAATTGTATTCTCATAATACTGAAATGCTTTAGCATA 
CTAGAATCAGATACAJ^JICTATTAAGTATGTCAACAGCCATTTAGGCAAATAAGCACTCC 
TTTAAJIGCCGCTGCCTTCTGGTTACAATTTACAGTGTACTTTGTTAAAAACACTGCTGAG 
GCTTCATAATCATGGCTCTTAGAAACTCAGGAAAGAGGAGATAATGTGGATTAAAACCTT 
AAGAGTTCTAJlCCATGCCTACTAAATGTACAGATATGCAAATTCCATAGCTCAATAAAAG 
AA-TCTGATACTTAGACCAAAJiAAAAAAA 
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FTGTJRE 82 

x/usr/se qdt)2 / s £ t / DKA/ Dna s e qs . rr.i n / s s . 0 504 
xsubunit 1 of 1, 5 50 aa, 0 step 
xIvW: 594 33, pi: 8.34, NX (S/T) : 2 

M£AAV7I?ALGLGVCLLLL?GPAGSEGAAPIAITCFTRGU3IRKEKADVLCPGGCPLEEFS 

VYGNI VYASVSS I CGAA'J>:RGVISKSGG?VRVYSLFGRENYSSVr)ANGIQSQMLSRWSAS 

Fr^^^:<GKSSTQHATGQAVSTAHPPTG:<RLKKTPEKKTGNKDCXADIAFLIDGSFKIGQRR 

FNLQKNFVGKVALT^GIGTEGPKVGLVQASEHFKIEFYLKNFTSAKDVLFAIKEVGFRGG 

NSNTGKALKHTAQKFrTVDAGVRKGIPKVVWFIDGWPSDDIHEAGIV;^ 

VAiC? : PEELGrf /QDVTFVDKAVCRNNGFFS YHMPNWFGTTKYVKPLVQKI^CTh2:QMK 

TCYNS VNI APL I DGSS S VGDSN7RL^CiEFVSNIAKTFE I SD IGAKIAAVQFTYDQRTEFS 

FTDYSTKEmn^AVIRNIRYMSGGTATGDAISFTVRNVFGPIRSSPNKNFLVIVTDGQSYD 

D VQGP^J^-KDAG I TI FS VGVAWAJ=LDDLKDMASKPKESHAFFTRSFTGLEP I VSDVI RG 

ICRDFLESQQ 
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FIGURE 83 

CGCCGCGCTCCCGCACCCGCGGCCCGCCCACCGCGCCGCTCCCGCATCTGCACCCGC;^GC 
CCGGCGGCCTCCCGGCGGGAGCGAGGAGATCCAGTCCGGCCCGCAGCGCA;iC'^CGGTCC;i 
GTCGGC-GCGGCGGCTGCGGGCGCAGAGCGGAG 
xMET ,{trans = l-s, dir=f , res=l} 

ATGCAGCGGCTTGGC-GCCACCCTGCTGTGCCTGCTGCTGGCGGCGGCGG'^CCCCACGGCC 
CCCGCGCCCGCTCCGACGGCGACCTCGGCTCCAGTCAAGCCCGGCCCGGCTCTCAGCTAC 
CCGCAGGAGGAGGCCACCCTCAATGAQATGTTCCGCGAGGTTGAGGAACTGATGGAGGAC 
ACGCAGCACAAATTGCGCAGCGCGGTGGAAGAGATGGAGGCAGAAGAAGCTGCTGCTAA^. 
GCATCATCAGAA.GTGAACCTGGCAAACTTACCTCCCAGCTATCACAATGAGACCAACACA 
GACACGAAGGTTGGAAATAATACCATCCATGTGCACCGAGAAATTCACAAGATAACCAAC 
AACCAGACTGGACAAATGGTCTTTTCAGAGACAGTTATCACATCTGTGGGAGACGAAGAA 
GGCAGAAGGAGCCACGAGTGCATCATCGACGAGGACTGTGGGCCCAGCA'-GTACTGCCAG 
TTTGCCAGCTTCCAGTACACCTGCCAGCCATGCCGGGGCCAGAGGATGCTCTGCACCCGG 
GACAGTGAGTGCTGTC<5AGACCAGCTGTGTGTCTGGGGTCACTGCACCAAAATGGCCACC 
AGGGGCAGCAATGGGACCATCTGTGACAACCAGAGGGACTGCCAGCCGGGGCTGTGCTGT 
GCCTTCCAGAGAGGCCTGCTGTTCCCTGTGTGCACACCCCTGCCCGTGGAGGGCGAGCTT 
TGCCATGACCCCGCCAGCCGGCTTCTGGACCTCATCACCTGGGAGCTAGAGCCTGATGGA 




CCCAGAGAGGTCCCCGATGAGTATGAAGTTGGCAGCTTCATGGAGGAGGT'GCGCCAGGAG 

CTGGAGGACCTGGAGAGGAGCCTGACTGAAGAGATGGCGCTGGGGGAGCCTGCGGCTGCC 

GCCGCTGCACTGCTGGGAGGGGAAGAGATTTAGATCTGGACCAGGCTGTGGGTAGATGTG 

CAATAGAAATAGCTAATTTATTTCCCCAGGTGTGTGCTTTAGGCGTGGGCTGACCAGGCT 
TCTTCCTACATCTTCTTCCCAGTaAr^TTTr'r^nn'rr-^r.r.^-.mrnr^T.^*r^r,,rT,«*««™«,«« 




AGGCAGGGTTAAACTGCAGGAGCAGTTTGCCACCCCTGTCCAGATTATTGGCTGCTTTGC 
CTCTACCAGTTGGCAGACAGCCGTTTGTTCTACATGGCTTTGATAATTGTTTGAGGGGAG 
GAGATGGAAACAATGTGGAGTCTCCCTCTGATTGGTTTTGGGGAAATGTGGAGAAGAGTG 
CCCTGCTTTGCAAACATCAACCTGGCAAAAATGCAACAAATGAATTTTCCACGCAGTTCT 
TTCCATGGGCATAGGTAAGCTGTGCCTTCAGCTGTTGCAGATGAAATGTTCTGTTCACCC 

AGCATTTTCATATCCAAGATCAATTCCCTCTCTCAGCACAGCCTGGGGAGGGGGTCATTG 
TTCTCCTCGTCCATCAC-GGATCTCAGAGGCTCAGAGACTGCAAGCTGCTTGCCCAAGTCA 
CACAGCTAGTGAAGACCAGAGCAGTTTCATCTGGTTGTGACTCTAAGCTCAGTGCTCTCT 

?C'; ; ^-^^^^GCATGCACATCTGGAATTAAGGTCAAACTAATTCTCACAT 
AAAGTAAACTACTGTTAGGAACAGCAGTGTTCTCACAGTGTGGGGCAGCCGTCCTTCTAA 

tgaagacaatgatattgacactgtccctctttggcagttgcKttagtaac^ 

ATATGACTGAGCGTAGCATACAGGTTAACCTGCAGAAACAGTACTT^ 

cgaggattataaatgaaatttgcaaaatcacttagcagSactSI^ 

ACGTGGAGAAAATCAAACCGAGCAGGGCTGTGTGAAACATGGT?^ 

aacactgaactctacgccactccacaaatgatgtt-tcagctotcat^^ 

CATGTATTCATCCAGAGTTCTTAAAGTTTAAAGTTG.CACATGAT^^^ 
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FIGURE S4 

MQRLGATLLCLLLJi^Ji.VPTAFAFA?TATSAPVlC?G?ALSYF:;'EEATIJ^ZV 
TQHKLH S AVE EHE AEE AAAKA S S IJ;^^ F P S YHNE TI^.TC 

NCTGQ>IVrSETVITSVGDEEC-RRSHECIIDEDCG?SMYCCFA^FQYTCQ?CRGORMLCTR 
DSECCGDQLCVWGHCTKMATRGSNGTICDNORDCQPGLCCAJ'CRCLLFFVCTP^PVSGEL 
CKDPASRLLDL1TWZLEPDGALDRCPGA3GLLCQPKSHSLVYVCKFTFVGSRDQDGEILL 
PREVFDEYEVGSFKEEVRQELEDLERSLTEEMALGEPAAAAAALLGGEEI 
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FIGURE 85A 

;'Ji.QGAC-GCTGG3AG3AAAGAG3TAAGAAAGGTTAGAGAACCTACCTCACATCTCTCTGGGCT 
CAGAAGGACTCTGAAGATAACAATAATTTCAGCCCATCCACTCTCCTTCCCTCCCAAACACA 
CATGTGCATGTACACACACACATACACACACATACACCTTCCTCTCCTTCACTGAAGACTCA 
CAGTCACTCACTCTGTGAGCAGGTCATAGAAAAGGACACTAAAGCCTTAJ^GGACAGGCCTGG 
CCATTACCTCTGCAGCTCCTTTGGCTTGTTGAGTCAAAAAACATGGGAGGGGCCAGGCACGG 
TGACTCACACCTGTAATCCCAG'CATTTTGGGAGACCGAGGTGAGCAGATCACTTGAGGTCAG 
GAGTTCGAGACCAGCCTGGCCAACATGGAGAAACCCCCATCTCTACTAAAAATACAAAAATT 
AGCCAGGAGTGGTGGCAGGTGCCTGTAATCCCAGCTACTCAGGTGGCTGAGCCAGGAGAATC 
GCTTGAATCCAGGAGGCGGAGGATGCAGTCAGCTGAGTGCACCGCTGCACTCCAGCCTGGGT 
G ACAG AATG AG ACT CTGT CT CAAACAAACAAACACGGGAGGAGGGGTAGATACTGCTT CTCT 
GCAACCTCCTTAACTCTGCATCCTCTTCTTCCAGGGCTGCCCCTGATGGGGCCTGGCAATGA 
CTGAGCAGGCCCAGCCCCAGAGGACAAGGAAGAGAAGGCATATTGAGGAGGGCAAGAAGTGA 
CGCCCGGTGTAGAATGACTGCCCTGGGAGGGTGGTTCCTTGGGCCCTGGCAGGGTTGCTGAC 
C CTTAC C CTGCAAAACACAAAGAG CAGGACTCCAGACT CTCCTTGTGAATGGTCCC CTG CCC 
TGCAGCTCCACCATGAGGCTTCTCGTGGCCCCACTCTTGCTAGCTTGGGTGGCTGGTGCCAC 
TGCCACTGTGCCCGTGGTACCCTGGCATGTTCCCTGCCCCCCTCAGTGTGCCTGCCAGATCC 
GGCCCTGGTATACGCCCCGCTCGTCCTACCGCGAGGCTACCACTGTGGACTGCAATGACCTA 
TTCCTGACGGCAGTCCCCCCGGCACTCCCCGCAGGCACACAGACCCTGCTCCTGCAGAGCAA 
CAGCATTGTCCGTGTGGACCAGAGTGAGCTGGGCTACCTGGCCAATCTCACAGAGCTGGACC 
TGTCCCAGAACAGCTTTTCGGATGCCCGAGACTGTGATTTCCATGCCCTGCCCCAGCTGCTG 
AGCCTGCACCTAGAGGAGAACCAGCTGACCCGGCTGGAGGACCACAGCTTTGC^VGGGCTGGC 
CAGCCTACAGGAACTCTATCTCAACCACAACCAGCTCTACCGCATCGCCCCCAGGGCCTTTT 
CTGG C CT C AG CAACTT G CTG CGGCTG CACCTCAACTCCAACCTCCTGAGGGCCATTGACAGC 
CGCTGGTTTGAAATGCTGCCCAACTTGGAGATACTCATGATTGGCGGCAACAAGGTAGATGC 
CATCCTGGACATGAACTTCCGGCCCCTGGCCAACCTGCGTAGCCTGGTGCTAGCAGGCATGA 
ACCTGCGGGAGATCTCCGACTATGCCCTGGAGGGGCTGCAAAGCCTGGAGAGCCTCTCCTTC 
TATGACAACCAGCTGGCCCGGGTGCCCAGGCGGGCACTGGAACAGGTGCCCGGGCTCAAGTT 
CCTAGACCTCAACAAGAACCCGCTCCAGCGGGTAGGGCCGGGGGACTTTGCCAACATGCTGC 
ACCTTAAGGAGCTGGGACTGAACAACATGGAGGAGCTGGTCTCCATCGACAAGTTTGCCCTG 
GTGAACCTCCCCGAGCTGACCAAGCTGGACATCACCAATAACCCACGGCTGTCCTTCATCCA 
CCCCCGCGCCTTCCACCACCTGCCCCAGATGGAGACCCTCATGCTCAACAACAACGCTCTCA 
GTGCCTTGCACCAGCAGACGGTGGAGTCCCTGCCCAACCTGCAGGAGGTAGGTCTCCACGGC 
AACCCCATCCGCTGTGACTGTGTCATCCGCTGGGCCAATGCCACGGGCACCCGTGTCCGCTT 
CATCGAGCCGCAATCCACCCTGTGTGCGGAGCCTCCGGACCTCCAGCGCCTCCCGGTCCGTG 
AGGTGCCCTTCCGGGAGATGACGGACCACTGTTTGCCCCTCATCTCCCCACGAAGCTTCCCC 
CCAAGCCTCCAGGTAGCCAGTGGAGAGAGCATGGTGCTGCATTGCCGGGCACTGGCCGAACC 
CGAACCCGAGATCTACTGGGTCACTCCAGCTGGGCTTCGACTGACACCTGCCCATGCAGGCA 
GGAGGTACCGGGTGTACCCCGAGGGGACCCTGGAGCTGCGGAGGGTGACAGCAGAAGAGGCA 
GGG CTATACAC CTGTGTGGC CCAGAACCTGGTGGGGGCTGACACTAAGACGGTTAGTGTGGT 
TGTGGGCCGTGCTCTCCTCCAGCCAGGCAGGGACGAAGGACAGGGGCTGGAGCTCCGGGTGC 
AGGAGACCCACCCCTATCACATCCTGCTATCTTGGGTCACCCCACCCAACACAGTGTCCACC 
AACCTCACCTGGTCCAGTGCCTCCTCCCTCCGGGGCCAGGGGGCCACAGCTCTGGCCCGCCT 
GCCTCGGGGAACCCACAGCTACAACATTACCCGCCTCCTTCAGGCCACGGAGTACTGGGCCT 
GCCTGCAAGTGGCCTTTGCTGATGCCCACACCCAGTTGGCTTGTGTATGGGCCAGGACCAAA 
GAGGCCACTTCTTGCCACAGAGCCTTAGGGGATCGTCCTGGGCTCATTGCCATCCTGGCTCT 
CGCTGTCCTTCTCCTGGCAGCTGGGCTAGCGGCCCACCTTGGCACAGGCCAACCCAGGAAGG 
GTGTGGGTGGGAGGCGGCCTCTCCCTCCAGCCTGGGCTTTCTGGGGCTGGAGTGCCCCTTCT 
GTCCGGGTTGTGTCTGCTCCCCTCGTCCTGCCCTGGAATCCAGGGAGGAAGCTGCCCAGATC 
CTCAGAAGGGGAGACACTGTTGCCACCATTGTCTCAAAATTCTTGAAGCTCAGCCTGTTCTC 
AGCAGTAGAGAAATC^CTAGGACTACTTTTTACCAAAAGAGAAGCAGTCTGGGCCAGATGCC 
CTGCCAGGAAAGGGACATGGACCCACGTGCTTGAGGCCTGGCAGCTGGGCCAAGACAGATGG 
GGCTTTGTGGCCCTGGGGGTGCTTCTGCAGCCTTGAAAAAGTTGCCCTTACCTCCTAGGGTC 
ACCTCTGCTGCCATTCTGAGGAACATCTCCAAGGAACAGGAGGGACTTTGGCTAGAGCCTCC 
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FIGURE R^K 

TGCCTCCCCATCTTCTCTCiGCCCAGAC-GCTCCTGGGCCTGGCTTGGCTGTCCCCTACCTGT 
GTCCCCGGGCTGCACCCCTTCCTCTTCTCTTTCTCTGTACAGTCTCAGTTGCTTGCTCTTGT 
GCCTCCTGGGCAAGGGCTGAAGGAGGCCACTCCATCrCACCTCGGGGGGCTGCCCTCAATGT 
GGGAGTGACCCCAGCCAGATCTGAAGGACATTTGGGAGAGGGATGCCCAGGAACGCCTCATC 
TGAGCAGCCTGGGCTCGGCATTCCGAAGCTGAGTTTCTATAGGCAATTTTGTACCTTTGTGG 
AG AAAT G T GT C AC CT C C C C C AAC CCG ATT CACT CTTTTCTCOTGTTTTGTAAAAAATAAAAA 
TAAATAATAACAi^.TAAAAAAA 
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FIGURE 86 

r^.LLVAPLLL-VA^AC-ATATVPWPVrHVPCFPQCACQIK?vrfT?RS£YREATT^/rCNDLF 

lta.vppalpagtqtlllqsnsivrvdqselgyla^xth:ldlsqnsfsdardcdfhalpq 

LLSLHLEENQLTRLEDHSFAGIJ;SLQELYl^'£NQLYHIAPRAFSGLSNLLRLHLNSN^ 

PA I D 5 R WFEML P^^: E I LM I GGNKVD AI IJ5MNFRP IJU^LRS LVTAG>^RE I S DY 

QSLESLSFYDNQIJU'.VPRRALEOVPGLKFI^UOTfPLQRVGPGDFANMI^KELGL^^ 

EELVSIDKFALVKL■FELTKLDITNNPRLSFIHPRAF^^a.PQMETIJ^i^mA^ 

VESLPNLQEVGLHGNFIRCDCVIRWANATGTRVRFIEPQSTLCAEFPDLQRLPVREVPF 

RFMTDHCL PL I S FRSF PPSLQVASGES>TVT^HCRALlASPEPE I YWVTPAGLRLTPAHAGR 

RYR\TFEGTLSLRRVTAEEAGLYTCVAQNLVGAI3TKTVSVVVGRALLQFGRDEGQGLEL 

P/\;0ETHPYKILLSI'A^TPPNTVSTNLTW3SASSIJIGQGATALARLPRGTHSYNITRLLQA 

TE YWACLQVAF AD AHTQ U^CWART KEATSOIRAU^DRPGLI AI IJUoAVLLI^ 

LGTGQPRKGVGGRRFLPPAWAFWGWSAP3VRWSAPLVLPWNPGRKLPRSSEGETLLPP 
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FICtURE 87A 

GCAAGCCAAGGCGCTGTTTGAGAAGGTGAAGAAGTTCCGGACCCATG-GGAGGAGGGGGA 
CATTGTGTACCGCCTCTAC ' 

xMET {trans = l-s, dir=f, res=l}> 

ATGCGGCAGACCATCATCAAGGTGATCAAGTTCATCCTCZVTCATCTGCTACACCGT^TAC 
TACGTGCACAACATCAAGTTCGACGTGGACTGCACCGTGGACATTGAGAGC^T'GACGGGC 




. _ ™.w^^ ^vj"^GGATGCTA 

CGGCGCTCCCTCAAGAAGTACTCGTTTGAGTCGATCCGTGAGGAGAGCAGC'-'ACAGCGAC 
ATCCCCGACGTCAAGAACGACTTCGCCTTCATGCTGCACCTCATTGACCAATACGACCCG 
CTCTACTCCAAGCGCTTCGCCGTCTTCCTGTCGGAGGTGAGTGAGAACAAGCTGCGGCAG 

ctgaacctcaacaacgagtggacgctggacaagctccggcagcggctcaccaagaacgcg 
caggacaagctggagctgcacctgttcatgctcagtggcatccctgacactgtgrttgac 
ctggtggagctggaggtcctcaagctggagctgatccccgacgtgaccatcccgcccagc 
attgcccagctcacgggcctcaaggagctgtggctctaccacacagcggccaagattgaa 
gcgcctgcgctcgccttcctgcgcgagaacctgcgggcgctgo^catcaagxtcaccgac 
atcaaggagatcccgctgtggatctatagcctgaagacactggaggagctgcacctgacg 
ggcaacctgagcgcggagaacaaccgctacatcgtcatcgacgggctgcgggagctcaaa 

CGCCTCAAGGTGCTGCGGCTCAAGAGCAACCTAAGCAAGCTGCCACAGGTGGTCACAGAT 

gtgggcgtgcacctgcagaagctgtccatcaacaatgagggcaccaagctca-^cgtcctc 
aacagcctcaagaagatggcgaacctgactgagctggagctgatccgctgcgacctggag 
cgcatcccccactccatcttcagcctccacaacctgcaggagattgacctcaaggacaac 
aacctcaagaccatcgaggagatcatcagcttccagcacctgcaccgcctcacctgcctt 
aagctgtggtacaaccacatcgcctacatccccatccagatcggcaacctcaccaacctg 
gagcgc ct ctac ctgaac cg caacaagatcgagaagatc cc ca g ccagctcttctactgc 
cgcaagctgcgctacctggacctcagccacaacaacctgaccttcctccctgccgacatc 

GGCCTCCTGCAGAACCTCCAGAACCTAGCCATCACGGCCAACCGGATCGAGACGCTCCC" 

ccggagctcttccagtgccggaagctgcgggccctgcacctgggcaacaacgtgctgcag 
tcactgccctccagggtgggcgagctgaccaacctgacgcagatcgagctgcggggcaac 
cggctggagtgcctgcctgtggagctgggcgagtgcccactgctcaagcgcagcggcttg 
gtggtggaggaggacctgttcaacacactgccacccgaggtgaaggagcggctgtggagg 
gctgacaaggagcaggcctgagcgaggccggcccagcacagcaagcagcaggaccgctgc 
ccagtcctcaggcccggaggggcaggcctagcttctcccagaactcccggacagccagga 
cagcctcgcggctgggcaggagcctggggccgcttgtgagtcaggccagagcgagaggac 
agtatctgtggggctggccccttttctccctctgagactcacgtccccaagggcaagtgc 
ttgtggaggagagcaagtctcaagagcgcagtatttggataatcagggtctcctccctgg 
aggccagctctgccccaggggctgagctgccaccagaggtcctgggaccctcacrrtivgt 
tcttggtatttatttttctccatctcccacctccttcatccagataacttatacattccc 
aagaaagttcagcccagatggaaggtgttcagggaaaggtgggctgccttttccccttgt 
ccttatttagcgatgccgccgggcatttaacacccacctggacttcagcagagtggtccg 
gggcgaaccagccatgggacggtcacccagcagtgccgggctgggctctgcggtgcggtc 
cacgggagagcaggcctccagctggaaaggccaggcctggagcttgcctcttcagttttt 
gtggcagttttagttttttgtttttttttrrrttaatcaaaaaacaattttttttaaaaa 
aaagctttgaaaatggatggtttgggtattaaaaagaaaajwjiaacttaaaaaaaaaaa 
gacactaacggccagtgagttggagtctcagggcagggtggcagtttcccttgagcaaag 
cagccagacgttgaactgtgtttcctttccctgggcgcagggtgcagggtgtcttccgga 
tctggtgtgaccttggtccaggagttctatttgttcc-ggggagggaggtttttttgttt 
gttttttgggtttttttggtgtcttgttttctt7ctcctccatgtgtcttggcaggcact 
catttctgtggctgtcggccagagggaatgttctggagctgccaaggagggaggagactc 
gggttggctaatccccggatgaacggtgctccattcgcacctcccctcctcgtgcctgcc 
ctgcctctccacgcacagtgttaaggagccaagaggagccacttcgcccagactttgttt 
ccccacctcctgcggcatgggtgtgtccagtgccaccgctc-gcctccgctgcttccatca 
gccc7gtcgccacctgg7ccttcatgaagagcagacacttaga.ggctggtcgggaatggg 
gaggtcgcccctgggagggcaggcgttggttccaagccggttcccgtccctggcgcctgg 



89 / 128 



wo 99/H328 



PCT/US98/19330 



FTGTJRE R7B 

AC-TC-CACACAC'CCCAGTCC-GCACCTGSTGGCTGGAAGCCAACCTC-CTTTAGATCACTCGG 
GTCCCCACCTTAGAJIGGGTCCCCGCCTTAGATCAATCACGTGGACACTAAGGCACGTTTT 
A.GAGTCTCTTGTCTTAATGATTATGTCCATCCGTCTGTCCGTCCATTTGTGTTTTCTGCG 
TCGTGTCATTGGATATAATCCTCAGAAATAATGGACACTAGCCTCTGACA-ACCA.TGAAGC 
AAJIAAT C CGTTAC ATGTGGG T CTGAACTTGT AGACTCGGT CACAGTATC AAATAAAAT CT 
ATAAC AG AAAAAAAJiJ^JiAAAA 
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FIGURE 88 

></usr/£eqdb2/£St/DN?i/:;naseq5 .air./ss .DN^JSGIB 

xsubur.it 1 ct 1, 546 aa, 0 szop 

><r-TK: 65742, pi: 8.62, NX (S/TJ : 6 

MRQT : IKVIK7ILI ICYT\nrArrC^ I KFDVDCTvT'IESLTGYRTYR 

Y I S LVI F YGL I CMYTIVTWMLRRSL KK Y S FH S I REE S S Y£ D I ?D VKNDFAFMLKL I DQ YD P 
L YS KRr AVr'LS E VSENKLRQLNIJ^EWTLDKIJIQRLTKNAQDK^ 

L^/ELEV"LF^EL I PD^vTTI PFS lAQLTGLKELWL Y?rrAAKI EAPAIAFLRENLRALHI KFTD 

IKEI?LKIYSLKTLEELHLTGNLSAENT^YIVIDGLRELKRIJCVLRLKS^^:.SKL?QVV^ 

VGV>:LC:<LSIK"TCEGTKLIVIJ^SLKKjviANLTSL^^ 

NLKT : EE 1 1 S FQHLHRLTCLKiWYNH I AY I P I Q I GNT^TNLERLYLNRl^I EK^ 
R:<LR"fLDLSH^^-FLPADIGLLQNLQl^ITANRIETLP?EIjFQCRKLRALHLGNNV^ 
SLPSRVGELTNLTQIELRGNRLECLFVELGECPLLKRSGLWEEDLFNTLPPEVKERLWR 
ADKSQA 
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FTGTIRF. SO 

GCCTGrTGCTGATGCTGCCGTGCGGTACTTGTC 
xMET {trans = l-s, dir=f, reE = l} 

ATGGAGCTGGCACTGCGGCGCTCTCCCGTCCCGCGGTGGTTGCTGCTGCTGCCGCTGCTG 
CTGGGCCTGAACGCAGGAGCTGTCATTGACTGGCCCACAGAGGAGGGCAAGGAAGTATGG 
GATTATGTGACGGTCCGCAAGGATGCCTACATGTTCTGGTGGCTCTATTATGCCACCAAC 
TCCTGCAAGAACTTCTCAGAACTGCCCCTGGTCATGTGGCTTCAGC-GCGGTCCAGGCGGT 
TGTAGCACTGGATTTGGAAACTTTGAGGAAATTGGGCCCCTTGACAGTGATCTCAAACCA 
CGGAAAAiCCACCTGGCTCCAGGCTGCCAGTCTCCTATTTGTGGATAATCCCGTGGGCACT 
GGGTTCAGTTATGTGAATGGTAGTGGTGCCTATGCCAAGGACCTGGCTATGGTGGCTTCA 
GACATGATGGTTCTCCTGAAGACCTTCTTCAGTTGCCACAAAGAATTCCAGACAGTTCCA 
TTCTACATTTTCTCAGAGTCCTATGGAGGAAAAATGGCAGCTGGCATTGGTCTAGAGCTT 
TATAAGGCCATTCAGCGAGGGACCATCAAGTGCAACTTTGCGGGGGTTGCCTTGGGTGAT 
TCCTGGATCTCCCCTGTTGATTCGGTGCTCTCCTGGGGACCTTACCTGTACAGCATGTCT 
CTTCTCGAAGACAAAGGTCTGGCAGAGGTGTCTAAGGTTGCAGAGCAAGTACTGAATGCC 
GTAAATAAGGGGCTCTACAGAGAGGCCACAGAGCTGTGGGGGAAAGCAGAAATGATCATT 
GAACAGAACACAGATGGGGTGAACTTCTATAACATCTTAACTAAAAGCACTCCCACGTCT 
ACAATGGAGTCGAGTCTAGAATTCACACAGAGCCACCTAGTTTGTC7TTGTCAGCGCCAC 
GTGAGACACCTACAACGAGATGCCTTAAGCCAGCTCATGAATGGCCCCATCAGAAAGAAG 
CTCAAAATTATTCCTGAGGATCAATCCTGGGGAGGCCAGGCTACCAACGTCTTTGTGAAC 
ATGGAGGAGGACTTCATGAAGCCAGTCATTAGCATTGTGGACGAGTTGCTGGAGGCAGGG 
ATCAACGTGACGGTGTATAATGGACAGCTGGATCTCATCGTAGATACCATGGGTCAGGAG 
GCCTGGGTGCGGAAACTGAAGTGGCCAGAACTGCCTAAATTCAGTCAGCTGAAGTGGAAG 
GCCC7GTACAGTGACCCTAAATCTTTGGAAACATCTGCTTTTGTCAAGTCCTACAAGAAC 
CTTGCTTTCTACTGGATTCTGAAAGCTGGTCATATGGTTCCTTCTGACCAAGGGGACATG 
GCTCTGAAGATGATGAGACTGGTGACTCAGCAAGAATAGGATGGATGGGGCTGGAGATGA 
GCTGGTTTGGCCTTGGGGCACAGAGGTGAGCTGAGGCCGCTGAAGCTGTAGGAAGCGCCA 
TTCTTCCCTGTATCTAACTGGGGCTGTGATCAAGAAGGTTCTGACCAGCTTCTGCAGAGG 

ata.w^tcattgtctctggaggc:aatttggaaattatttctgcttcttaaaaaaacctaa 

GATTTTTTAAAAAATTGATTTGTTTTGATCAAAATAAAGGATGATAATAGATATTAA 
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FTGIJRE 90 

>< signal peptjde> 

MEL.^RRSPVPRV^XLLLPLLLGLKA 
>< start mature protein> 
GAVIDTOTEEGKEW 
xhomoiogy to peptidases 40-end> 
DYVTVRKDAYMFWWLYYATNSCK 
Xpotentia) N-glycosylation site> 

NFSELPL\Tvm.QGGPGGSSTGFGNFEElGPlJ)SDLKPKKTTWLQAASlI^ 
GFSYV 

Xpotentia! N-glycosylation site> 

NGSGAYAKJ^LAMVASDMMVLLKTFFSCHKEFQTWFYIFSESYGGKiX^ 

KAIQRGTIKCNFAGVALGDSWISPVDSVLSWGPYLYSMSLLEDKGLAEVSKVAEQVL 

NAVNKGLYREATELWGKAEMirEQNTDGVNFmLTKSTPTSTMESSl^ 

CQRHVEiILQRDALSQLNlNGPIRiaa.KI]PEDQSWGGQAT>^ 

DELLEAGl 

xpotentiat N-glycosylation site> 

M^TVYNGQLDLIVDTMGQEA\V"VTUCLKWPELPKFSQIX\V^ 
YKNLAFY\mKAGHMWSDQGDMALKMMRLVTQQE 
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F I GTJR E 91 

GG CCGGGoGAGAGGfiGGCC 

xMET {trans=l-s, dir=f, res = i} 

ATGGGCGCGCGCGGGGCGCTGCTGCTGGCGCTGCTGCTGGGTCGGGCTGGACTCAGGAAG 
CGGGAGTCGCAGGAGGCGGCGCCGTTATCAGGACCATGCGGCCGACGGGTaATCACGTCG 
CGCATCGTGGGTGGAGAGGACGCCGAACTCGGGCGTTGGCCGTGGCAGGGGAGCCTGCGC 
CTGTGGGATTCCCACGTATGCGGAGTGAGCCTGCTCAGCCACCGC7GGGCACTCACGGCG 
GCGCACTGCTTTGAAACCTATAGTGACCTTAGTGATCCCTCCGGGTGGATGGTCCAGTTT 
GGCCAGCTGACTTCCATGCCATCCTTCTGGAGCCTGCAGGCCTACTACACCCGTTACTTC 
GTATCGAATATCTATCTGAGCCCTCGCTACCTGGGGAATTCACCCTATGAGATTGCCTTG 
GTGAAGCTGTCTGCACCTGTCACCTACACTAAACACATCCAGCCCATCTGTCTCCAGGCC 
TCCACATTTGAGTTTGAGAACCGGACAGACTGCTGGGTGACTGGCTGGGGGTACATCAAA 
GAGGATGAGGCACTGCCATCTCCCCACACCCTCCAGGAAGTTCAGGTCGCCATCATAAAC 
AACTCTATGTGCAACCACCTCTTCGTCAAGTACAGTTTCCGCAAGGACATCTTTGGAGAC 
ATGGTTTGTGCTGGCAACGCCCAAGGCGGGAAGGATGCCTGCTTCGGTGACTCAGGTGGA 
CCCTTGGCCTGTAACAAGAATGGACTGTGGTATCAGATTGGAGTCGTGAGCTGGGGAGTG 
GGCTGTGGTCGGCCCAATCGGCCCGGTGTCTACACCAATATCAGCCACCACTTTGAGTGG 
ATCCAGAAGCTGATGGCCCAGAGTGGCATGTCCCAGCCAGACCCCTCCTGGCCACTACTC 
TTTTTCCCTGTTCTCTGGGCTCTCCCACTCCTGGGGCCGGTCTGAGCCTACCTGAGCCCA 
TGCAGCCTGGGGCCACTGCCAAGTCAGGCCCTGGTTCTCTTCTGTCTTGTTTGGTAATAA 
ACACATT C CAGTTGATGCCTTGCAGGGCATTCTTCAAAAAAAAAAAAAAAAAAAAAAAAA 



9^ / 128 



wo 99/14328 



PCT/US98/19330 



FIGURE 92 

xsignaJ pepude> 
MGARGALLLALLU^R.A.GL 
xstan mature protein> 

RKPESQEA,^LSGPCGRRVITSRIVGGEDAELGRWWQGSDU.\VT)SHVCGVSLLSHRWA 
xSerine proteases, trypsin family, histidine active site 'LTAAHC'> 

LTA.-UiCFETYSDLSDPSGW\rv^QFGQLTSMPSFWSLQAYyTRY?VSMYLSPRyLGNS 

PYD1AL"VTCLSAP\TYTKHIQPICLQASTFEFE 

xpoientia] K-glycosylation site> 

NRTDCWVTGWGYHCEDEALPSPHTLQEVQVAn 

X potential N-glycosylation site> 

NNSMCNmFLKYSFRKDIFGDMVCAGNAQGGKI^ACFGDSGGPLACNKNGLWYQIG 

WSWGVGCGRPNRPGVYT 

xpotential N-glycosylation site> 

NISHHFEWIQKLMAQSGMSQPDPSWPLLFFPLLWALPLLGPV 



95 / 128 



wo 99/14328 



PCT/US98/19330 



C C C ACG CG T C CG CGGhC G CG TG G GAAGGG 'GAGA 
<KET {rrans^l-E, dir=f, res=l} 

ATGGGA.CTCCAJlGCGTGCCTCCTAGGGCTG7TrGCCCTCATCCTCTCTGGCAAA.TGCAGT 
TACAGCCCGGAGCCCGACCAGCGGAGGACGGTGCCCCCAGGCTGGGTGTCCCTC-GGCCGT 
GCGGACCCTGAGGAAGAGCTGAGTCTCACCTTTGCCCTGAGACAGCAGAATGTGGAAAGA 
CTCTCGGAGCTGGTGCAGGCTGTGTCGGATCCCAGCTCTCCTCAATACGGAAAATACCTG 
ACCCTAGAGAATGTGGCTGATCTGGTGAGGCCATCCCCACTGACCCTCCACACGGTGCAA 
AAA.TGGCTCTTGGCAGCCGGAGCCCAGAAGTGCCATTCTGTGATCACACAGGACTTTCTG 
ACTTGCTGGCTGAGCATCCGACAAGCAGAGCTGC7GCTCCCTGGGGCTGAGTTTCATCAC 
TATGTGGGAGGACCTA.CGGAAACCCATGTTGTAAGGTCCCCACATCCCTACCAGCTTCCA 
CAGGCCTTGGCCCCCCATGTGGACTTTGTGGGGGGACTGCACCGTTTTCCCCCAACATCA 
TCCCTGAGGCAACGTCCTGAGCCGCAGGTGACAGGGACTGTAGGCCTGCATCTGGGGGTA 
ACCCCCTCTGTGATCCGTAAGCGATACAACTTGACCTCACAAGACGTGGGCTCTGGCACC 
AGCAATA-ACAGCCAAGCCTGTGCCCAGTTCCTGGAGCAGTATTTCCATGACTCAGACCTG 
GCTCAGTTCATGCGCCTCTTCGGTGGCAACTTTGCACATCAGGCATCAGTAGCCCGTGTG 
GTTGGACA-ACAGGGCCGGGGCCGGGCCGGGATTGAGGCCAGTCTAGATGTGCAGTACCTG 
ATGAGTGCTGGTGCCAACATCTCCACCTGGGTCTACAGTAGCCCTGGCCGGCATGAGGGA 
CAGGAGCCCTTCCTGCAGTGGCTCATGCTGCTCAGTAATGAGTCAGCCCTGCCACATGTG 
CATACTGTGAGCTATGGAGATGATGAGGACTCCCTCAGCAGCGCCTACATCCAGCGGGTC 
AACACTGAGCTCATGAAGGCTGCCGCTCGGGGTCTCACCCTGCTCTTCGCCTCAGGTGAC 
AGTGGGGCCGGGTGTTGGTCTGTCTCTGGAAGACACCAGTTCCGCCCTACCTTCCCTGCC 
I'CCAGCCCCTA-TGTCACCACAGTGGGAGGCACATCCTTCCAGGAACCTTTCCTCATCACA 
AATGAAATTGTTGACTATATCAGTGGTGGTGGCTTCAGCAATGTGTTCCCACGGCCTTCA 
TACCAGGAGGAAGCTGTAACGAAGTTCCTGAGCTCTAGCCCCCACCTGCCACCATCCAGT 
TACTTCAATGCCAGTGGCCGTGCCTACCCAGATGTGGCTGCACTTTCTGATGGCTACTGG 
GTGGTCAGCAACAGAGTGCCCATTGCATGGGTGTCCGGAACCTCGGCCTCTACTCCAGTG 
TTTGGGGGGATCCTATCCTTGATCAATGAGCACAGGATCCTTAGTGGCCGCCCCCCTCTT 
GGCTTTCTCAACCCAJVGGCTCTACCAGCAGCATGGGGCAGGTCTCTTTGATGTAACCCGT 
GGCTGCCATGAGTCCTGTCTGGATGAAGAGGTAGAGGGCCAGGGTTTCTGCTCTGGTCCT 
GGCTGGGATCCTGTAACAGGCTGGGGAACACCAACTTCCCAGCTTTGCTGAAGACTCTAC 
TCAACCCCTGACCCTTTCCTATCAGGAGAGATGGCTTGTCCCCTGCCCTGAAGCTGGCAG 
TTCAGTCCCTTATTCTGCCCTGTTGGAAGCCCTGCTGAACCCTCAACTATTGACTGCTGC 
AGACAGCTTATCTCCCTAACCCTGAAATGCTGTGAGCTTGACTTGACTCCCAACCCTACC 
ATGCTCCATCATACTCAGGTCTCCCTACTCCTGCCTTAGATTCCTCAATAAGATGCTGTA 
ACTAGCATTTTTTGAATGCCTCTCCCTCCGCATCTCATCTTTCTCTTTTCAATCAGGCTT 
TTCCAAAGGGTTGTATACAGACTCTGTGCACTATTTCACTTGATATTCATTCCCCAATTC 
ACTGCAAGGAGACCTCTACTGTCACCGTTTACTCTTTCCTACCCTGACATCCAGAAACAA 
TGGCCTCCAGTGCATACTTCTCAATCTTTGCTTTATGGCCTTTCCATCATAGTTGCCCAC 
TCCCTCTCCTTACTTAGCTTCCAGGTCTTAACTTCTCTGACTACTCTTGTCTTCCTCTCT 
CATCAATTTCTGCTTCTTCATGGAATGCTGACCTTCATTGCTCCATTTGTAGATTTTTGC 
TC7TCTCAGTTTACTCATTGTCCCCTGGAACAAATCACTGACATCTACAACCATTACCAT 
CrCACTAAATAAGACTTTCTATCCAJi.TAATGATTGATACCTCAAATGTAAAAAA 
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xsigTiai peptids> 
MGLQACLLGLFALILS 
xstan mature protem> 

GKCSYSPEPDQRJlTLPPGWVSLGRADPEEELSLTFALRQQrA'^ERLSELVQAVSDPSSP 

QYGKYLTLENVADLVRPSPLTUiTVQKWIiAAGAQKCHSVnQDFLTCWLSmQ.AE^ 

LLPGAEFHHY\'GGPTETHVVRSPfffYQLPQALAPH\T>FVGGLHRFPPTSSLRQRPEPQ 

\TGTVGLHLGVTPSVIRKRY 

xpotential N-glycosylation site> 

KLTSQDVGSGTS 

xpoteniial N-glycosylation site> 

NNSQACAQFLEQYFHDSDLAQFMRLFGGNFAHQASVARWGQQGRGRAGEEASLDV 
QYLMSAGA 

xpoientia! N-glycosylation site> 

MSTWVYSSPGRHEGQEPFLQUXMLLS 
xpotential N-gJycosylation site> 

N^SALPHVHWSYGDDEDSLSSAYIQRVNTELMKAAARGLTLLFASGDSGAGCWSVS 

GRHQFRPTFPASSPYVTTVGGTSFQEPFLIT^^IVDY1SGGGFSNVFP^U'SYQEEAVW 

LSSSPHLPPSSYF 

Xpoteniial N-glycosylation site> 

NASGRA^'PDVAALSDGYM^VVSNRVPIPWVSGTSASTPVFGGILSLINEHRILSGRPPL 
GFLNPRLYQQHGAGLFD\TRGCHESCLDEE\'EGQGFCSGPGWIIPVTGWGTPTSQLC 
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FIGUR E 95 

GCCGCGCGCTCTCTCCCGGCGCCCACACCTGTCTGAGC&SCGCAGCGAGCCGCGGCCCC-G 

GCGGGCTGCTCGGCGCGGAACAGTGCTCGGC 

xMET (trans = l-s, dir=f, res=l)> 

ATGGCAGGC-ATTCCAGGGCTCCTCTTCCTTCTCTTCrrTCTGCTCTGTGCTGTTGGGCAA 
GTGAGCCCTTACAGTGCCCCCTGGAAACCCACTTGGCCTGCATACCGCCTCCCTGTCGTC 
■ TTGCCCCAGTCTACCCTCAATTTAGCCAAGGCAGACTTTGGAGCCGAAGCCAAATTAGAA 
GTATCTTCXTCATGTGGACCCCAGTGTCATAAGGGAACTCCACTGCCCACTTACGAAGAG 
QCCAAGCAATATCTGTCTTATGAAACGCTCTATGCCAATGGCAGCCGCACAGAGACGCAG 
GTGGGCATCTACATCCTCAGCAGTAGTGGAGATGGGGCCCAACACCGAGACTCAGGGTCT 
TCAGGAAAGTCTCGAAGGAAGCGGCAGATTTATGGCTATGACAGCAGGTTCAGCATTTTT 
GGGAAGGACTTCCTGCTCAACTACCCTTTCTCAACATCAGTGAAGTTATCCACGGGCTGC 
ACCGGCACCCTGGTGGCAGAGAAGCATGTCCTCACAGCTGCCCACTGCATACACGATGGA 
AJ^AACCTATGTGAAAGGAACCCAGAAGCTTCGAGTGGGCTTCCTAAAGCCCAAGTTTAAA 
G2.TGGTGGTCGAGGGGCCAACGACTCCACTTCAGCCATGCCCGAGCAGATGAAATTTCAG 
TGGATCCGGGTGAAACGCACCCATGTGCCCAAGGGTTGGATCAAGGGCAATGCCAATGAC 
ATCGGCATGGATTATGATTATGCCCTCCTGGAACTCAAAAAGCCCCACAAGAGAAAATTT 
ATGAAGATTGGC-GTGAGCCCTCCTGCTAAGCAGCTGCCAGGGGGCAGAATTCACTTCTCT 
GGTTATGACAATGACCGACCAGGCAATTTGGTGTATCGCTTCTGTGACGTCAAAGACGAG 
ACCTATGACTTGCTCTACCAGCAATGCGATGCCCAGCCAGGGGCCAGCGGGTCTGGGGTC 
TATGTGAGGATGTGGAAGAGACAGCAGCAGAAGTGGGAGCGAAAAATTATTGGCATTTTT 
TCAGGGCACCAGTGC-GTGGACATGAATGGTTCCCCACAGGATTTCAACGTGGCTGTCAGA 
ATCACTCCTCTCAAATATGCCCAGATTTGCTATTGGATTAAAGGAAACTACCTGGATTGT 
AGGGAGGC-GTGACACAGTGTTCCCTCCTGGCAGCAATTAAGGGTCTTCATGTTCTTATTT 
TAGGAGAGGCCAAATTGTTTTTTGTCATTGGCGTGCACACGTGTGTGTGTGTGTGTGTGT 
GTGTGTAAGGTGTCTTATAATCTTTTACCTATTTCTTACAATTGCAAGATGACTGGCTTT 
ACTATTTGAAAACTGGTTTGTGTATCATATCATATATCATTTAAGCAGTTTGAAGGCATA 
CTTTTGCATAGAAATAAAAAAAATACTGATTTGGGGCAATGAGGAATATTTGACAATTAA 
GTTAATCTTCACGTTTTTGCAAACTTTGATTTTTATTTCATCTGAACTTGTTTCAAAGAT 
TTATATTAAATATTTGGCATACAAGAGATATGAAAAAAAAAAAAAAA 
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FIGUR E % 

xsignal peptidO 
MAGIPGLLFLLFFLLCAVG 
xstan mature protein> 

QVSPYSAPWKPTWPAYKLPV-VXPQSTLl^AKPDFGAEAKLEVSSSCGPQCHKGTPL^ 

TYEEAKQYLSYETLYA 

xpotential N-glycosylation site> 

NGSRTETQVGIYILSSSGDGAQHRDSGSSGKSRRKRQIYGYDSRFSIFGKDFLLNYPFS 
TSVKLSTGCTGTLVAEKHV 

"Xserine proteases, trypsin family, histidine active site 'LTAAHC*> 
LTAAHCIHDGKrYVKGTQKLRVGFLKPKFKDGGRGA 
xpotential N-glycosylation siie> 

]ST)STSAMPEQMKFQWmVKRTHWKGWIKGNANDIG\ffiTO 

KJGVSPPAKQLPGGRIHFSGYDNDI^GNLVYRFCDVKDETYDLLYQQCDAQPGASGS 

GWVR^nVKJ^-QQQKTORKIIGIFSGHQWVDMNGSPQDFNVAVRnT 

GNYLDCREG 



99 / 128 



wo 99/14328 



PCT/US98/19330 



FIGURE 92 

GCATC3CCCTGGGTCTCTCGAGCCTGCTGCCTGCTCCCCCGCCCCACCAGCC 
xMET {trans = l-s, dir^f, res = l} 

ATGGTGGTTTCTGGAGCGCCCCCAGCCCTGGGTGGGGGCTG7CTCGGCACCTTCACCTCC 
CTGCTGCTGCTGGCGTCGACAGCCATCCTCAATGCGGCCAGGATACCTGTTCCCCCAGCC 
TGTGGGAAGCCCCAGCAGCTGAACCGGGTTGTGGGCGGCGAGGACAGCACTGACAGCGAG 
TGGCCCTGGATCGTGAGCATCCAGAAGAATGGGACCCACCACTGCGCAGGTTCTCTGCTC 
ACCAGCCGCTGGGTGATCACTGCTGCCCACTGTTTCAAGGACAACCTGAACAAACCATAC 
CTGTTCTCTGTGCTGCTGGGGGCCTGGCAGCTGGGGAACCCTGGCTCTCGGTCCCAOAAG 
GTGGGTGTTGCCTGGGTGGAGCCCCACCCTGTG7ATTCCTGGAAGGAAGGTGCCTGTGCA 
GACATTGCCCTGGTGCGTCTCGAGCGCTCCATACAGTTCTCAGAGCGGGTCCTGCCCATC 
TGGCTACCTGATGCCTCTATCCACCTCCCTCCAAACACCCACTGCTGGATCTCAGGCTGG 
GGGAGCATCCAAGATC-GAGTTCCCTTGCCCCACCCTCAGACCCTGCAGAAGCTGAAGGTT 
CCTATCATCGACTCGGAAGTCTGCAGCCATCTGTACTGGCGGGGAGCAGGACAGGGACCC 
ATCACTGAGGACATGCTGTGTGCCGGCTACTTGGAGGGGGAGCGGGATGCTTGTCTGGGC 
GACTCCGGGGGCCCCCTCATGTGCCAGGTGGACGGCGCCTGGCTGCTGGCCGGCATCATC 
AGCTGGGGCGAGGGCTGTGCCGAGCGCAACAGGCCCGGGGTCTACATCAGCCTCTCTGCG 
CACCGCTCCTGGGTGGAGAAGATCGTGCAAGGGGTGCAGCTCCGCGGGCGCGCTCAGGGG 
GGTGGGGCCCTCAGGGCACCGAGCCAGGGCTCTGGGGCCGCCGCGCGCTCCTAGGGCGCA 
GCGGGACGCGGGGCTCGGATCTGAAAGGCGGCCAGATCCACATCTGGATCTGGATCTGCG 
GCGGCCTCGGGCGGTTTCCCCCGCCGTAAATAGGCTCATCTACCTCTACCTCTGGGGGCC 
CGGACGGCTGCTGCGGAJ^GGAAACCCCCTCCCCGACCCGCCCGACGGCCTCAGGCCCCC 
CTCCAAGGCATCAGGCCCCGCCCAACGGCCTCATG7CCCCGCCCCCACGACTTCCGGCCC 
CGCCCCCGGGCCCCAGCGCTTTTGTGTATATAAATGTTAATGATTTTTATAGGTATTTGT 

^^-ACCCTGCCCACATATCTTATTTATTCCTCCAATTTCAATAAATTATTTATTCTCCAAAA 

AAAAAA 
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FIGURE 98 

>< signal pepude> 

MV\^SG-APPALGGGCLGTFTSLLLLASTAILNA 
Xstart ma-ture pepude> 

ARIPVPPACGKPQQLNRWGGEDSTDSEWPV^IVSIQK 

xpotential N-g]ycosylation site> 

NGTHHCAGSLLTSRWV 

xSerme proteases active site ITAAHC> 

ITAAHCFKD>n.NTCPYLFSVLLGA\VQLGNTGSRSQKVGVAWVEPHPVYSWKEGACA 
DIALVRLERSIQFSERVLPICLPDASIHLPPNTHCWISGWGSIQDGVPLPHPQTLQKLKV 
PIIDSEVCSHL^-WRGAGQGPITEDMLCAGYLEGERDACLGDSGGPLMCQVDGAWLL 
AGIISWGEGCAERNRPGVTISLSAHRSWVEKIVQGVQLRGRAQGGOALRAPSQGSGA 
AARS 
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FTGTITtF m 



><M£T trar.5=l-S/ dir=f, res=l} 

ATGCACGGCTCCTGCAGTTTCCTGATGCTTCTGCTGCCGCTACTGCTACTGCrGGTGGCC 
ACCACAGGCCCCGTTGGAGCCCTCACAGATGAGGAGAAACGTTTGATGGTGGAGCTGCAC 
A^.CCTCTACCGGGCCCAGGTATCCCCGACGGCCTCAGACATGCTGCACATGAGATGGGAC 
GAGGAGCTGGCCGCCTTCGCCAAGGCCTACGCACGGCAGTGCGiGTGGGGCCACAACA^.G 
GAGCGCGGGCGCCGCGGCaAGAATCTGTTCGCCATCACAGACGAGGGCATGGACGTGCCG 
CTGGCCATGGAGGAGTGGCACCACGAGCGTGAGCACTACAACCTCAGCGCCGCCACCTGC 
AGCCCAGGCCAGATGTGCGGCCACTACACGCAGGTGGTATGGGCCAAGACAGAGAGGATC 
GGCTGTGGTTCCCACTTCTGTGAGAAGCTCCAGGGTGTTGAGGAGACCAACATCGAATTA 
CTGGTGTGCAACTATGAGCCTCCGGGGAACGTGAAGGGGAAACGGCCCTACCAGGAGGGG 
ACiCCGTGCTCCCAATGTCCCTCTGGCTACCACTGCAAGAACTCCCTCTGTGAACCCATC 
GG.^J'.GCCCGGAAGATGCTCAGGATTTGCCTTACCTGGTAACTGAGGCCCCATCCTTCCGG 
GCGACTGAAGCATCAGACTCTAGGAAAATGGGTACTCCTTCTTCCCTAGCAACGGGGATT 
CCGGCTTTCTTGGTAACAGAGGTCTCAGGCTCCCTGGCAACCAAGGCTCTGCCTGCTGTG 
GAAACC CAGGC CCCAACTTCC T TAGCAACGAAAGACCCGCCCTCCATGGCAACAGAGGC T 
CCACCTTGCGTAACAACTGAGGTCCCTTCCATTTTGGCAGCTCACAGCCTGCCCTCCTTG 
GATGAGGAGCCAGTTACCTTCCCCAAATCGACCCATGTTCCTATCCCAAAATCAGCAGAC 
AAAGTGACAGACAAAACAAAAGTGCCCTCTAGGAGCCCAGAGAACTCTCTGGACCCCAAG 
A^CrCCCTGACAGGGGCAAGGGAACTCCTACCCCATGCCCAGGAGGAGGCTGAGGCTGAG 
GCTGAGTTGCCTCCTTCCAGTGAGGTCTrGGCCTCAGTTTTTCCAGCCCAGGACAAGCCA 
GGTGAGCTGCAGGCCACACTGGACCACACGGGGCACACCTCCTCCAAGTCCCTGCCCAAT 
TTCCCCAATACCTCTGCCACCGCTAATGCCACGGGTGGGCGTGCCCTGGCTCTGCAGTCG 
TCCTTGCCAGGTGCAGAGGGCCCTGACAAGCCTAGCGTTGTGTCAGGGCTGAACTCGGGC 
CCTGGTCATGTGTGGGGCCCTCTCCTGGGACTACTGCTCCTGCCTCCTCTGGTGTTGGCT 
GGA^i-TCTTCTGAATGGGATACCAGTCAAAGGGTGAAGAGGTCAGCTGTCCTCCTGTCATC 
TTCCCCACCCTGTCCCCAGCCCCTAAACAAGATACTTCTTGGTTAAGGCCCTCCGGAAGG 
GA^iJ^GGCTACGGGGCATGTGCCTCATCACACCATCCATCCTGGAGGCACAAGGCCTGGCT 
GGCTGCGAGCTCAGGAGGCCGCCTGAGGACTGCACACCGGGCCCACACCTCTCCTGCCCC 
TCCCTCCTGAGTCCTGGGGGTGGGAGGATTTGAGGGAGCTCACTGCCTACCTGGCCTGGG 
GCTGTCTGCCCAC^.CAGCATGTGCGCTCTCCCTGAGTGCCTGTGTAGCTGGGGATGGGGA 
XrCCTAGGGGCAGATGAAGGACAAGCCCCACTGGAGTGGGGTTCTTTGAGTGGGGGAGGC 
AGGGACa^GGGAAGGAAAGTAACTCCTGACTCTCCAATAAAAACCTGTCCAACCTGTGA;!. 
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FTCxTIRF inn 

xsignal peptide> 

MHGSCSFL-MLLLPLLLLLVATT 

xstan of mature peptide, extracellular domain> 

GPVGALTDEEKRLMVELHNXYRAQVSPTASDMLHMRWDEEL^.^KAYARQCW 

GHNTCERGRRGENLFAITDEGMDVPLAMEE\VlfflEREHY 

Xpotential N-glycosylation site> 

NXSA-ATCSPGQMC 

><GHyTQ\^^AKT ExtraceUular proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 1 - CRISP 
signature> 

GHYTQWWAKTERIGCGSHFCEKLQGVEETNIEL 

xLLVCNYEPPGHV Exuacelblar proteins SCP/Tpx-l/Ag5/PR-l/Sc7 signature 2 -CRISP 

sigriamre> 

LVC^TEPPGNVKGKRPYQEGTPCSQCPSGYHCKNSLCEPIGSPEDAQDIPYLVTEAPSFR 
ATEASDSRK^IGTPSSLATGPAFL\TEVSGSIATKAIJ'AVETQAPTSLATKDPPSMATEA 
PPCVTTEVTSrLAAHSLPSLDEEPVTFPKSTHVTIPKSADKVTDKTKVPSRSPENSLDPK 
MSLTGARHiPRAQEEAEAEAELPPSSEVIJ^VFPAQDKPGELQATmHTGHTSSKSLPN 
FP 

xpotential N-glycosyiation srte> 
NTSATA 

xpotential N-glycosylation site> 
NATGGRALALQSSLPGAEGPDKPSWSGLNS 
Xpotential glycosylphosphatidylinositoi attachment site> 
GPGH\n.VGPLLGLLLLPPLVLAGIF 
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FIGURE 101 A 

gtaactgaagtcaggcttttcatttggga.2.gccccctcaacag;^attcggtcattctcc;^ 

AGTT 

xMET {trar-s = l-s, dir=f, res = l) 

ATGGTGGACGTACTTCTGTTGTTCTCCCTCTGCTTGCTTTTTCACATTAGCAGACCGGAC 
TTAAGTCACA-ACAGATTATCTTTCATCAAGGCAAGTTCCATGAGCCACCTTCAAAGCCTT 
CGAGAAGTGA^CTGAJVCAACAATGAATTGGAGACCATTCCAAATCTGGGACCAGTCTCG 
GCAA-ATATTACACTTCTCTCCTTGGCTGGAAACAGGATTGTTGAAATACTCCCTGAACAT 
CTGAA-AGAGTTTCAGTCCCTTGAAACTTTGGACCTTAGCAGCAACAATATTTCAGAGCTC 
CAAJlCTGCATTTCCAGCCCTACAGCTCAAATATCTGTATCTCAACAGCAACCGAGTCACA 
TCAATGGAACCTGGGTATTTTGACAATTTGGCCAACACACTCCTTGTGTTAAAGCTGAAC 
AGGAACCC-AJITCTCAGCTATCCCACCCAAGATGTTTAAACTGCCCCAACTGCAACATCTC 
GAATTGAACCGAAACAAGATTAAAAATGTAGATGGACTGACATTCCAAGGCCTTGGTGCT 
CTGAAGTv- ^ CTGAAAATGCAJIAGAAATGGAGTAACGAAACTTATGGATGGAGCTTTTTGG 
C^-GCTGAGCAJi.CATGGAAATTTTGCAGCTGGACCATAACAACCTAACAGAGATTACCAAA 
GGCTC<;CTTTACGGCTTGCTGATGCTGCAGGAACTTCATCTCAGCCAAAATGCCATCAAC 
AGGATCAGCCCTGATGCCTGGGAGTTCTGCCAGAAGCTCAGTGAGCTGGACCTAACTTTC 
AATCACTTATCAJ^GGTTAGATGATTCAAGGTTCCTTGGCCTAAGCTTACTAAATACACTG 
CACATTGGGAACAACAGAGTCAGCTACATTGCTGATTGTGCCTTCCGGGGGCTTTCCAGT 
TT;iJ^X-ACTTrGGATCTGAAGAACAATGAAATTTCCTGGACTATTGAAGACATGAATGGT 
GCTTTCTCTGGGCTTGACAAACTGAGGCGACTGATACTCCAAGGAAATCGGATCCGTTCT 
ATrr^^CTAAAAAAGCCTTCACTGGTTTGGATGCATTGGAGCATCTAGACCTGAGTGACAAC 




. ^^.^^^..x i. i i^iv^v_oAi itJLL:AUL-TAAAATGGCTCCCACAGTGGGTGGCG 

GAAAACAACTTTCAGAGCTTTGTAAATGCCAGTTGTGCCCATCCTCAGCTGCTAAAAGGA 
AGAAGCATTT7TGCTGTTAGCCCAGATGGCTTTGTGTGTGATGATTTTCCCAAACCCCAG 
ATCACGGTTCAGCCAGAAACACAGTCGGCAATAAAAGGTTCCAATTTGAGTTTCATCTGC 
TCAGCTGCCAGCAGCAGTGATTCCCCAATGACTTTTGCTTGGAAAAAAGACAATGAACTA 




TGTGTCATCTCCAATCACTTTGGTTCATCCTACTCTGTCAAAGCCAAGCTTACAGTAAAT 

ATGCTTCCCTCATTCACCAAGACCCCCATGGATCTCACCATCCGAGCTGGGGCCATGGCA 

CGCTTGGAGTGTGCTGCTGTGGGGCACCCAGCCCCCCAGATAGCCTGGCAGAAGGATGGG 
GGClA.C!AGArTTrr^ar:rTr:pa mrv^7^r■•^/-.•^ r,,^^ — ^ 




rii: "-----^-w^ ^--'^v^^ i«.va/i<^>^W\ i 1 LiLUJUTATACAGCTGCACAGCTCAGAAC 

^-SI5X^^^^^^^'^"^^^^'^GCAACTCTGACTGTCCTAGAAACACC7^TCATTm 
'^L^G^CACTGTTGGACCGAACTGTAACCAAGGGAGAAACAGCCGTCCTACAGTGCATTGCT 




;:; i;"^^"" " " " " ^^^^^^^^-^i'-AUL-rTCTGATTATTGTGGACrCAGATGTCAGT 

GATG^TGGGAAATACACATGTGAGATGTCTAACACCCTTGGCACTGAGAGAGGAAACGTG 

^^-^^S^^^^J^^'^^^^CTCCAACCTGCGACTCCCCTCAGATGACAGCCCC^^ 

GACGATGACGGATGGGCCACTGTGGGTGTCGTGATCATAGCCGTGGTTTGCTGTGTGGTG 

;';;^^^^?if^^^'^^^^^GTG^TCATCATATACCACACAA^ 

AGCATTACCAACACAGATGAGACCAACTTGCCAGCAGATATTCCTAGTTATTTGTCATCT 
;;?i:i!:;^S^y^^^^'^^^^^^^^GGATGGGTACGTGTCTTCAGAAAGTGGAAGCCACCAC 
CAGTTTGTCACATCTTCAGGTGCTGGATTTTTCTTACCACAACATGACAGTAGTGGGACC 
^^^S^^^!IS^^^^^°^GTGAAGCTGATGTGGAAGC^^ 

^-Lb X ^ TTi ^u-GATCCACAGGCCCTATGTATTTGAAGGGAAATGTGTATGGCTCAGATrCT 
TTpAAACATATCATACAGGTTGCAGTCCTGACCCAAGAACAGT^^^ 
GAoCCCAGTTACATAAAGAAAAAGGAGTGCTACCCATGTTCTCATCCTTCAGAAGAATCC 

rTAr.r-r2irL2.Tr,i.z.r.r_^r^^^GAATGAAAAATCTGTGTCTAAACAAGTCCTCTTTA 

3CGTCGGTTGCCTCGAGTAATTCTTTCATGGGTACC 
"ACCTAGATGCCTATTCAAGCTTTGGACAGCCATCA 
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FTGTIRF iniR 

GATTGTCAGCCAAGAGCCTTTTATiTGAA.^GCTCI^?TCTTCCCCAGACTTGGACTCTGGG 

T CAGAG GAAG AT GG GAAAGAAAGG ACAGAT TTTC^IGGAAGAAAATC ACAT TTGTACGTTT 

.^■AACAGAGTTTAGAAAACTACA.GGAGTCCAAATTTTCAGTCTTA-TGACTTGGACACATAG 

AGTGAATC-AGACCAAAGGAJiAAGCTTAACATACTACCTCAAGTGA-ACTTTTATTTAAAAG 

AGAGAGAATCTTATGTTTTTTAAATGGAGTTATGAATT7TAAAAGGATAAAAATGCTTTA 

TTTATACAGATGAAGCAJUiJi.TTACAJ:LAAAGTTATGAAAATTTTTATACTGGGAATG 

TCATATAAGAATACCTTTTTAAACTA.TTTTTTAACTTrGTTTTATGCAAAAAAGTATCTT 

ACGTAAATTAATGATATAAATCATGATTATTTTATGTATTTTTATAATGCCAGATTTCTT 

TTTATGGAAAATGAGTTACTAAAGCATTTTAAATAATACCTGCCTTGTACCATTTTTTAA 

ATAGAAGTTACTTCATTATATTTTGGACATTATATTTAATAAAJVTGTGTCAATTTGAA 
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FIGURF 102 

ANITLLSLAGr^-IVEILPEHLKEFQ3LETLDLSSNT\-ISELQTAr?ALQLKYLyLNSNRVT 
SMEPGYECNIJ^TLLVLKI^^^RISAIPPKMFKIjPQLCHLEUnRNKIK^ 

lkslk:^crngvtkij^dgafwglsnmeilqldhnklteitkgwlygllklqelhlsqnain 

RISFCAVTEFCQKLSSLDLTraHLSRLDDSSFLGLSLLNTLHIGNNRVSYIArCAFRGLSS 

LKTLDLKNNEISVTTIEDmGAFSGLDKIJ^ILQGNRIRSITK^JTGLDALEKLDLSDN 

AIMSLQGNAFSQMKKLQQLHLNTSSLLCDCQLKWLPQWAENNFQSFVNASCAHPQLLKG 

RSIFAVSPDGFVCDDrPKPQITVQPETQSAIKGSNLSFICSAA5£SD£PMTFAWKKDNEL 

LKDAEMENYAHLRAQGGEWEYTTIIJllJlEVEFASEGKTQCVIENHFGSSySVKA^ 

MLFS FTKTPMD LTI RAGAMARLECAAVGHPAPQ I AWQKDGGTDr PAARERRMHVMPEDDV 

FF IVDVKI ED IGVYSCTAQNSAG3 1 SANATLTVLETPSFLRPIiLDRTVTKGETAVLQC lA 

GGSPFFKLNWTKDDSFLVVTERHFFAAGNQLLIIVDSDVSDAGKYTCEMSNTLGTERGNV 

RLSVIPTPTCDSPOrOTAPSLDDDGWATVGWI lAWCCWGTSLWAATI lYHTRRRNEDC 

SITNTDETJTLPAIDIPSYLSSQGTLADRQDGYVSSESGSHHQFVTSSGAGFFLPQKDSSGT 

CK I DNS S EAD VEAATDLFLCPFLGSTG PMYLKGNVYGSDPFETYETGCS PDPRTVLMDHY 

E?SYIKiaCECyPCSHPSEESCERSFSKISWPSHVllia.LKTSYSHNEGPGM^ 

DFSANPEPASVASSNSFMGTFGKALRRPHLDAYSSFGQPSDCQPRAFYLKAHSSPDLDSG 

SZEDGKERTDFQSENHICTFKQTLENYRTPNFQSYDLDT 
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GGGGAGAGGAATTGACCATGTAAAii.GGAGACTTTTTTTTTTGGTGGTGGTGGCTGTTGGG 

TGCCTTGCAAAAATGAAGGATGCAGGA.CGCAGCTTTCTCCTGGAACCGAACGCAATGGAT 

AJiACTGATTGTG'CAAGAGAGAAGGAAGAACGAAGCTTTTTCTTGTGAGCCCTGGATCTTA 

ACAC^JIATGTGTATATGTGCACACAGGGA.GCATTCAAGAATGAAATAAACCAGAGTTAGA 

CCCGCGGGGGTTGGTGTGTTC7GA.CATAAATAAATAATCTTAAAGCAGCTGTTCCCCTCC 

CCACCCCCAAAAAAAAGGATGATTGGAAJLTGAAGAACCGAGGATTCACAAAGAAAAJiAGT 

ATGTTCATTT-TCTCTATAAAGGAGAAAGTGAGCCAAGGAGATATTTTTGGAATGAAAAG 

TTTGGGGCTTTTTTAGTAAAGTAAAGAACTGGTGTGGTGGTGTTTTCCTTTCTTTTTGAA 

TTTCCCACAAGAGGAGAGGAAATTAJITAATACATCTGCAAAGAAATTTCAGAGAAGAJIAA 

GTTGACCGCGGCAGATTGAGGCATTGATTGGGGGAGAGAAACCAGCAGAGCACAGTTGGA 

TTTGTGCCTATGTTGACTAAAATTGACGGATAATTGCAGTTGGATTTTTCTTCATCAACC 

TCCTTTTTTTTAAATTTTTATTCCTTTTGGTATCAAGATCATGCGTTTTCTCTTGTTCT^ 

AACCACCTGGATTTCCATCTGGATGTTGCTGTGATCAGTCTGAAATACAACTGTTTGAAT 

TCCAGAAGGACCAACACCAGATAA;i.TTATGA. 

xMZT (trans = l-s, dir=f, res = l} 

ATGTTGAACAAGATGACC7TACATCCACAGCAGATAATGATAGGTCCTAGGTTTAACAGG 
GCCCTATTTGACCCCCTGCTTGTGGTGCTGCTGGCTCTTCAACTTCTTGTGGTGGCTGGT 
CTGGTGCGGGCTCAGACCTGCCCTTCTGTGTGCTCCTGCAGCAACCAGTTCAGCAAGGTG 
ATTTGTGTTCGGAAAAACCTGCGTGAGGTTCCGGATGGCATCTCCACCAACACACGGCTG 
CTGAACCTCCATGAGAACCAAATCCAGA.TCATCAAAGTGAACAGCTTCAAGCACTTGAGG 
CTICTTGGAAAT C CTACAGTTGAGTAGGAACCATATCAGAACCATTGAAATTGGGGCTTTC 
AATGGTCTGGCGAACCTCAACACTCTGGAACTCTTTGACAATCGTCTTACTACCATCCCG 
AATGGAGCTTTTGTATACTTGTCTAAACTGAAGGAGCTCTGGTTGCGAAACAACCCCATT 
GAAAGCATCCCTTCTTATGCTTTTAACAGAATTCCTTCTTTGCGCCGACTAGACTTAGGG 
GAATTGAAAAGACTTTCATACATCTCAGAAGGTGCCTTTGAAGGTCTGTCCAACTTGAGG 
TATTTGAACCTTGCCATGTGCAACCTTCGGGAAATCCCTAACCTCACACCGCTCATAAAA 
CTAGATGAGCTGGATCTTTCTGGGAATCATTTATCTGCCATCAGGCCTGGCTCTTTCCAG 
GGTTTGATGCACCTTCAAAAACTGTGGATGATACAGTCCCAGATTCAAGTGATTGAACGG 
AATGCCTTTGACAACCTTCAGTCACTAGTGGAGATCAACCTGGCACACAATAATCTAACA 
TTACTGCCTCATGACCTCTTCACTCCCTTGCATCATCTAGAGCGGATACATTTACATCAC 
AACCCTTGGAACTGTAACTGTGACATACTGTGGCTCAGCTGGTGGATAAAAGACATGGCC 
CGCTCGAACACAGCTTGTTGTGCCCGGTGTAACACTCCTCCCAATCTAAAGGGGAGGTAC 
ATTGGAGAGCTCGACCAGAATTACTTCACATGCTATGCTCCGGTGATTGTGGAGCCCCCT 
GCAGACCTCAATGTCACTGAAGGCATGGGAGCTGAGCTGAAATGTCGGGCCTCCACATCC 
CTGACATCTGTATCTTGGATTACTCCAAATGGAACAGTCATGACACATGGGGCGTACAAA 
GTGCGGATAGCTGTGCTCAGTGATGGTACGTTAAATTTCACAAATGTAACTGTGCAAGAT 
ACAGGCATGTA.CACATGTATGGTGAGTAATTCCGTTGGGAATACTACTGCTTCAGCCACC 
CTGAATGTTACTGCAGCAACCACTACTCCTrTCTCTTACTTTTCAACCGTCACAGTAGAG 
ACTATGGAACCGTCTCAGGATGAGGCACGGACCACAGATAACAATGTGGGTCCCACTCCA 
GTGGTCGACTGGGAGACCACCAATGTGACCACCTCTCTCACACCACAGAGCACAAGGTCG 
ACAGAGAAAACCTTCA.CCATCCCAGTGACTGATATAAACAGTGGGATCCCAGGAATTGAT 
GAGGTCATGAAGACTACCAAAATCATCATTGGGTGTTTTGTGGCCATCACACTCATGGCT 
GCAGTGATGCTGGTCATTTTCTACAAGATGAGGAAGCAGCACCATCGGCAAAACCATCAC 
GCCCCAJICAAGGACTGTTGAAATTATTAATGTGGATGATGAGATTACGGGAGACACACCC 
ATGGAAAGCCACCTGCCCATGCCTGCTATCGAGCATGAGCACCTAAATCACTATAACTCA 
TACAAATCTCCCTTCAACCACACAACAACAGTTAACACAATAAATTCAATACACAGTTCA 
GTGCATGAACCGTTATTGATCCGAATGAACTCTAAAGACAATGTACAAGAGACTCAAATC 
TAAAACATTTACAGAGTTACAAAAAACAAACAATCAAAAAAAAAGACAGTTTATTAAAAA 
TGACACAAATGACTGGGCTAAATCTACTGTTTCAAAAAAGTGTCTTTACAAAAAAACAAA 
AAAGAAAAGAAATTTA7TTATTAAAAATTCTATTGTGATCTAAAGCAGACAAAAA 
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FTGTIRR104 

xsignal peptide> 

^^L^la^rLHPQQIMIGPRFNR.M-FDPLL\"Vl.L.M.QLLWAGL\'RA 
xstart mature peptide, extracellular domaiii> 
QTCPSVCSCSNQFSKVICVRK^^REVPDGISTNTRL 
Xstart leucine rich repeat domains> 

LNLHENQIQIIKWSFKHLRHLEILQLSR>mimiGAFNGLANLmi^ 
G.AJVYLSKLKELmRNNPIESIPSYAFNRIPSIJUUJDLGELKR^ 
LNLAMCNLR£PNLTPLIKU)ELDLSGNmSAIIU>GSFQGLMHLQKLWMIQS 
NAFDNLQSLVEINLAH^^eTLLPHDLFIPLHHLERIHU^H^ 

DMAPSNTACCARCKTPPNLKGRYIGEU>QNYFrCYAPVI\^PPADLNVTEGMAAELK 

CRASTSLTSVSWITPNGTVMTHGAYKVRIAVLSDGTLNF^mTVQDTG^^ 

S^^ONTTASATLNVTAATTTPFSYFSTVTVETMEPSQDEARTTDN^ 

><end leucine rich repeat domains> 

TTNVTTSLTPQSTRSTEKTFTIPVTDINSGPGIDE 

Xstart transmembrane domajn> 

VMKTTKIIIGCFVAJTLMAAVMLVI 

Xstart intraceiiulat domain> 

FYKMRKQHHRQNHRAPTRTVElINVDDErrGDTPMESHLPMPAE 
PFNHTTTVNTINSIHSSVHEPIXrRMNSKDNVQETQI 
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nOXJRE 105 A 

AGCCGACGCTGCTCAAGCTGCAACTCTGTTGCAC-TTGGCAGTTCTTTTCGGTTTCCCTCCTG 
CTGTTTGGGGGCATGAAAGGGCTTCGCCGCCGGGAGTAAAAGAAGGAATTGAGCGGGCAGCG 
CC-AGGGAGGAGCGCGCACGCGACCGCGAGGGCGGGCGTGCACCCTCGGCTGGAAGTTTGTGC 
CG GG CC.C CGAG CG CG CGCCGGCTGGGAGCTTCGGGTAGAGACCTAGGCCGCTGQACCGCQAT 
GAGCGCGCCGAGCCTCCGTGCGCGCGCCGCGGGGTTGGGGCTGCTGCTGTGCGCGGTGCTGG 
GGCGCGGTGGCCGGTCCGACAGCGGCGGTCGGGGGGAACTCGGGCAGCCCTCTGGGGTAGCC 
GCCGAGCGCCCATGCCCCACTACCTGCCGCTGCCTCGGGGACCTGCTGGACTGCAGTCGTAA 
GCGGCTAGCGCGTCTTCCCGAGCCACTCCCGTCCTGGGTCGCTCGGCTGGACTTAAGTCACA 
ACAGATTATCTTTCATCAAGGCAAGTTCCATGAGCCACCTTCAAAGCCTTCGAGAAGTGAAA 
CTGAACAACAATGAATTGGAGACCATTCCAAATCTGGGACCAGTCTCGGCAAATATTACACT 
T C T CT C CTT GG CTG G AAACAGG ATTGTTGAAATACT CCCTGAACATCTGAAAGAGTTTCAGT 
CCCTTGAAACTTTGGACCTTAGCAGCAACAATATTTCAGAGCTCCAAACTGCATTTCCAGCC 
CTACAGCTCAAATATCTGTATCTCAACAGCAACCGAGTCACATCAATGGAACCTGGGTATTT 
TGACAATTTGGCCAACACACTCCTTGTGTTAAAGCTGAACAGGAACCGAATCTCAGCTATCC 
CACCCAAGATGTTTAAACTGCCCCAACTGCAACATCTCGAATTGAACCGAAACAAGATTAAA 
AATGTAGATGGACTGACATTCCAAGGCCTTGGTGCTCTQAAGTCTCTGAAAATGCAAAGAAA 
TGGAGTAACGAAACTTATGGATGGAGCTTTTTGGGGGCTGAGCAACATGGAAATTTTGCAGC 
TGGACCATAACAACCTAACAGAGATTACCAAAGGCTGGCTTTACGGCTTGCTGATGCTGCAG 
G7iACTTCAT CTCAG C CAAAATGCCATCAACAGGATCAGCCCTGATGCCTGGGAGTTCTGCCA 
GAAGCTCAGTGAGCTGGACCTAAC'TTTCAATCACTTATCAAGGTTAGATGATTCAAGCTTCC 
T1.G Z CCTAAGCTTACTAAATACACTGCACATTGGGAACAACAGAGTCAGCTACATTGCTGAT 
TGTGCCTTCCGGGGGCTTTCCAGTTTAAAGACTTTGGATCTGAAGAACAATGAAATTTCCTG 
GACTATTGAAGACATGAATGGTGCTTTCTCTGGGCTTGACAAACTGAGGCGACTGATACTCC 
AAGGAAATCGGATCCGTTCTATTACTAAAAAAGCCTTCACTGGTTTGGATGCATTGGAGCAT 
CTAGACCTGAGTGACAACGCAATCATGTCTTTACAAGGCAATGCATTTTCACAAATGAAGAA 
ACTG CAA GAATTG CAT TTAAATACATCAAGCCTTTTGTGCGATTGCCAGCTAAAATGGCTCC 
CACAGTGGGTGGCGGAAAACAACTTTCAGAGCTTTGTAAATGCCAGTTGTGCCCATCCTCAG 
CTGCTAAAAGGAAGAAGCATTTTTGCTGTTAGCCCAGATGGCTTTGTGTGTGATGATTTTCC 
CAAACCCCAGATCACGGTTCAGCCAGAAACACAGTCGGCMTAAAAGGTTCCAATTTGAGTT 
TCATd^GCTCAGCTGC C AG CAG CAGTG ATT C C CCAATGACTTTTG CTTGGAAAAAAG ACAAT 
GAACTACTGCATGATGCTGAAATGGAAAATTATGCACACCTCCGGGCCCAAGGTGGCGAGGT 
GATGGAGTATACCACCATCCTTCGGCTGCGCGAGGTGGAATTTGCCAGTGAGGGGAAATATC 
AGTGTGTCATCTCCAATCACTTTGGTTCATCCTACTCTGTCAAAGCCAAGCTTACAGTAAAT 
ATGCTTCCCTCATTCACCAAGACCCCCATGGATCTCACCATCCGAGCTGGGGCCATGGCACG 
CTTGGAGTGTGCTGCTGTGGGGCACCCAGCCCCCCAGATAGCCTGGCAGAAGGATGGGGGCA 
CAGACTTCCCAGCTGCACGGGAGAGACGCATGCATGTGATGCCCGAGGATGACGTGTTCTTT 
ATCGTGGATGTGAAGATAGAGGACATTGGGGTATACAGCTGCACAGCTCAGAACAGTGCAGG 
AAGTATTTCAGCAAATGCAACTCTGACTGTCCTAGAAACACCATCATTTTTGCGGCCACTGT 
TGGACCGAACTGTAACCAAGGGAGAAACAGCCGTCCTACAGTGCATTGCTGGAGGAAGCCCT 
CCCCCTAAACTGAACTGGACCAAAGATGATAGCCCATTGGTGGTAACCGAGAGGCACTTrTT 
TGCAGCAGGCAATCAGCTTCTGATTATTGTGGACTCAGATGTCAGTGATGCTGGGAAATACA 
CATGTGAGATGTCTAACACCCTTGGCACTGAGAGAGGAAACGTGCGCCTCAGTGTGATCCCC 
ACTCCAACCTGCGACTCCCCTCAGATGACAGCCCCATCGTTAGACGATGACGGATGGGCCAC 
TGTGGGTGTCGTGATCATAGCCGTGGTTTGCTGTGTGGTGGGCACGTCACTCGTGTGGGTGG 
TCATCATATACCACACAAGGCGGAGGAATGAAGATTGCAGCATTACCAACACAGATGAGACC 
AACTTGCCAGCAGATATTCCTAGTTATTTGTCATCTCAGGGAACGTTAGCTGACAGGCAGGA 
TGGGTACGTGTCTTCAGAAAGTGGAAGCCACCACCAGTTTGTCACATCTTCAGGTGCTGGAT 
TTTTCTTACCACAACATGACAGTAGTGGGACCTGCCATATTGACAATAGCAGTGAAGCTGAT 
GTGGAAGCTGCCACAGATCTGTTCCTTTGTCCGTTTTTGGGATCCACAGGCCCTATGTATTT 
GAAGGGAAATGTG7ATGGCTCAGATCCTTTTGAAACATATCATACAGGTTGCAGTCCTGACC 
CAAGAACAGTTTTAATGGACCACTATGAGCCCAGTTACATAAAGAAAAAGGAGTGCTACCCA 
TGTTCTCATCCTTCAGAAGAATCCTGCGAACGGAGCTTCAGTAATATATCGTGGCCTTCACA 
TGTGAGGAAGCTACTTAACACTAGTTACTCTCACAATGAAGGACCTGGAATGAAAAATCTGT 
GTCTAAACAAGTCCTCTTTAGATTTTAGTGCAAATCCAGAGCCAGCGTCGGTTGCCTCGAGT 
A^.TTCTTTCATGGGTACCTTTGGAAAAGCTCTCAGGAGACCTCACCTAGATGCCTATTCAAG 
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FIGTJRE 105B 

CTTTGGACAC-CCATCAGATTGTCAGC'CAAGAGCCTTTTATTTGAAAGCTCATTCrrCCCCAG 
A GTTGGA CTCr GGGT CAGAGGAAGATGGGAAAGAAAGGACAGATTTTCAGGAAGAAAAr CAC 
ATTTGTACCTTTAAACAGACTTTAGAAAACTACAGGACTCCAAATTTTCAGTCTTATGACTT 
GGACACATAGACTGAATGAGACCAAAGGAAAAGCTTAJICATACTACCTCAAGTGAACTTTTA 
TTTAAAAGAGAGAGAATCTTATGTTTTTTAAATGGAGTTATGAATTTTAAAAGGATAAAAAT 
GCTTTATTTATACAGATGAACCAAAATTACAAAAAGTTATGAAAATTTTTATACTGGGAATG 
ATGCTCATATAAGAATACCTTTTTAAACTATTTTTTAACTTTGTTTTATGCAAAAAAGTATC 
TTACGTAAATTAATGATATAAATCATGATTATTTTATGTATTTTTATAATGCCAGATTTCTT 
TTTA7GGAAAATGAGTTACTAAAGCJi.TTTTAAATAATACCTGCCTTGTACCATTTTTTAAAT 
AGAAGTTACTTCATTATATTTTG'CACATTATATTTAATAAAATGTGTGAATTTGAAAAAAAA 
AA>-AAAAAAAAAAAAAAAAAAAA 
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FTGTIRF. lOfi 

x/usr/ secdb2/£St/DNA/Dnaseqs .r.in/ss . DNA3 714 : 
xsuhunit: 1 cr 1 , 1119 aa , 1 stop 
><m: 123434, pi: 6. OS, NX{S/7}: 12 

M£AJ=£LRA?J^Jir-LGLLLCAVLGRAGRSDSGGRGSLGOPSGVAAZR?CPTTCRCLG^ 
SRKR1AJILPEFLPSWVARLDLSHLIRLSFIKASSMSFXQSI^REV:<I^^ 

axitilsij^gnriv£ilpehlkefqsletijlssott:sslotafpalqlkylyi.nsnrvt 

Sr/H?GYrDl.XArTTLLVLKLNRNRISAIPPKMFKLPQLQ:^ELNRN:<IKN^^ 
LKSLKJ/QRNGVT}a.KDGAFWGLSN>lEILQI^HNNLTEITKGWLYGLLMLQEIJIL£QN 

rispdavzefcqklseldltfnhlsrlddssflglsllktlkignnrvsyiadcafrglss 

LKTLDLWEISWTIEDr^^GAFSGLDKLRRLILQGNRIRSITKKAFTGIZ^AI^ 
AIKSLQGNAF£Q^rraGLQCLKLNTSSLLCDCQLK>^LPQWAENNFQSFVNASCAHPQLLK^ 

rsifavspdgfvcddffkpqitvqpetqsaikgsnlsficsaasssdspmtfawkkdnel 

LHDAEME^^fAHLRAQGGEVMEYTTILRLREVEFASEGKyQCVISNKFGSSYSVKAKLTVN 
MI.?SFTKTP^mLTIRAGAJ^ARLECAAVGHPAPQIAWQKDGGTDFPAARERRMHVMPEDDV 
FFITOVTCIEDIG^/YSCTAQNSAGSISANATLTVLSTPSFIJIPIJ^RTVTKGSTAVLQCIA 
GGSPPPKLNVTIODSFLVVTERHFFAAGNQLLIIVDSDVSDAGKyrCEMSNTLGTERGNV 
RLS VI PTPTCDS PQMTAPSLDDDGWATVGWI I AWCCWGTSLVWWI I YHTRRRNEDC 

SITKTDETNLPADIPSYLSSQGTLADRQDGYVSSESGSHHQFVTSSGAGFFLPQHDSSGT 
CHIDKSSEADVSAATDLFLCPFLGSTGPMYLKGNVYGSDPFETYin'GCSPDPRTVI^HY 

e?£yik:c<ecypcsk?seescersfsniswpshvrklij^syshnegpgmknlc^^ 

DFSANPSPASVASSNSFKGTFGKA-LRRFHLDAYSSFGQPSDCQPRAFYLKAHSSPDLDSG 

seedgkertdfqeenkictfkqtlenyrtpnfqsydldt 
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FIGURKinTA 

C^^_^^.CTTGCGTCGCGGAGAGCGCCCi^.GCTTGACTTG;^-:^.TGC-;iAGG^^^^^ 

Gs.u.-A^'CTGAor^XTGGGGGAGCGCGTTCGGCCTGTGGGGCGCCGCTCGGCGCCGGGGCGCAG 

CAGGG^-AGGGG.-A.GCTGTGGTCTGCCCTGCTCCACGAGGCGCCA.CTGGTGTGAACCGG-GAG.:. 

AGCGGCG.AGGCA.GGAGGGCTCATGGTGAGCAAGGAGGCCGGCTGATCTGCAGGCGCACAGCA 
TTCCGAGTTTACAGATTTTTACAGATACCAA^.TGGAAGGCGAGGAGGCAGAACAGCCTGCCT 
GGTTCCATCAGCCCTGGCGCCCAGGCGCATCTGACTCGGCACCCCCTGCAGGCACCATGGCC 
CAGAGCCGGGTGCTGCTGCTCCTGCTGCTGCTGCCGCCACAGCTGCACCTGGGAGCTGTGCT 
TGCCG7GAGGGCCCCAGGATTTGGCCGAAGTGGCGGCCACAGCCTGAGCCCCGAAGAGAACG 
^-^.TTTGCGGAGGAGGAGCCGGTGCTGGTACTG;i.GCCCTGAGGAGCCCGGGCCTGGCCC:AGCC 
GCG'oTCAGCTGCCCCCGAGACTGTGCCTGTTCCCAGGAGGGCGTCGTGGACTGTGGCGGTAT 
TGACCTGCGTGAGTTCCCGGGGGAGCTGCCTGAGCACACCAACCACCTATCTCTGCAGAACA 
ACCAGCTGGAAAAGATCTACCCTGAGGAGC7CTCCCGGCTGCACCGGCTGGAGACACTGAAC 
CTGCAAAACAACCGCCTGACTTCCCGAGGGCTCCCAGAGAJ^C-GCGTTTGAGCATCTGACCAA 
CCTCA;^.TTACCTG7ACTTGGCCAJITAACAAGCTGACCTTGGCACCCCGCTTCCTGCCAAACG 
CCCTGATCAGTGTGGACTTTGCTGCCAACTATCTCACCAAGATCTATGGGCTCACCTTTGGC 

cagaagccaaacttgaggtctgtgtacctgcacaacaacaagctggcagacgccgggctgcc 
ggacaacatgttcaacggctccagcaacgtcgaggtcctcatcctgtccagcaacttcctgc 
gcc^cgtgcccap.gcacctgccgcctgccctgtacaagctgcacctcaagaacaacaagctg 
gagaagatccccccgggggccttcagcgagctgagcagcctgcgcgagctatacctgcagaa 

CAACTACCTGACTGACGAGGGCCTGGACAACGAGACCTTCTGGAAGCTCTCCAGCCTGGAGT 
ACCTGGA7CTGTCCAGCAACAACCTGTCTCGGGTCCCAGCTGGGCTGCCGCGCAGCCTGGTG 
C7GC7GCAC77GGAGAAGAACGCCATCCGGAGCG7GGACGCGAATGTGCTGACCCCCATCCG 
CAGCC7GGAG7ACC7GCTGCTGCACAGCAACCAGCTGCGGGAGCAGGGCATCCACCCACTGG 
CCT7CCAGGGCC7CAAGCGG77GCACACGGTGCACCTGTACAACAACGCGCTGGAGCGCGTG 
CCCAG7GGCC7GCCTCGCCGCG7GCGCACCCTCATGA7CC7GCACAACCAGATCACAGGCAT 
7GGCCGCGAAGAC7TTGCCACCACC7AC7TCCTGGAGGAGCTCAACC7CAGCTACAACCGCA 
TCACCAGCCCACAGGTGCACCGCGACGCC7TCCGCAAGCTGCGCC7GCTGCGCTCGCTGGAC 
CTGTCGGGCAACCGGCTGCACACGCTGCCACCTGGGCTGCC7CGAAATGTCCATGTGCTGAA 
GGTCAAGCGCAA7GAGC7GGCTGCCTTGGCACGAGGGGCGC7GGCGGGCATGGCTCAGCTGC 

GACC7CGCCCA7C7GCAGC7GCTGGACA7CGCCGGGAA7CAGCTCACAGAGATCCCCGAGGG 
o.TCC^C.~aG7CAC7TGAG7ACC7G7ACCTGCAGAACAACAAGA77AG7GCGGTGCCCGCCA 
GCC. 7.GACTCCACGCCCAACC7CAAGGGaA7CT7TC7CAGG77TAACAAGCTGGC7G7G 




^";::^^^^:-^f^^^^f^T^GCCA7AGCAGC7CGCCG7CTGCCCTGTCCATCTGTCCGTC 
A.^GGAuA^C.....T^...iGC7GC7GGGGCCT7GGGGCAGGAGTGAAGCAGAGGTGA7GGGG 
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FIGURE 107R 

C T G C- C- C T G A G C C AG G GAG G AA.G G AC C C AG G T G C AC C TAG G AGACAC C CCCGCTCrT CPlGGC C 
C GTGGGGGAAGT T CCGGGTGCCTTTATT TTTT ATT CTT 7TC7AJ^.C-GA_-JsJ^A;^J^.TGA-': AAAAA 
TCTGAAJ^.GCTGATTTrTCTTGTTATAGAAAAACTAATATA.-J^AGCAT7ATCCCTATCCCTGC 
J^AJiA-^-AAAA-E-. 
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FTGITRF, ins 



Kez 


Glu 


Gly 


Glu 


Glu 


Ala 


Glu 


Gin 


Pro 


Ale 


Trp 


Phe 


His 


Gin 


Pre 


Trp 
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Pro 


Gly 
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Ser 
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Ser 
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Gly 


Thr 
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PA^ 
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Ser 
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Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 


Pro 


Pro 
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Leu 


His 


Leu 


Gly 
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Val 
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Gly 
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Leu 
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FIGURE 109 

gg-c-agggggctccggg"gccgcg'cagcagacctactccg3ccg2gcgcctcgccgctgtc 
crccgggagcggcagcagtagcccgggcggcgagggctgggggttcctcgagactctcag 
aggggcgccrcccatcggcggccacai^ccccaacctgttcctcgcgggccagtgcgctgc 
gccccaggacccggtgcccaacatggattttctcctggcgctggtgctggtatcctcgct 
ctacct gcaggcgg ccg ccgagttcgacgggaggtggcccagg caaatagtgtcatcgat 
tggcctatgtcgttatggtgggaggattgactgctgctggggctgggctcgccagtcttg 
gggacagtgtcagcctgtgtgccaaccacgatgcaaacatggtgaatgtatcgggccaa;^. 
caagtgcaagtgtcatcctggttatgctggaaaaacctgtaatcaagatctaaatgagtg 
tggcctgaagccccggccctgtaagcacaggtgcatgaacacttacggcagctacaagtg 
ctactgtctcaacggatatatgctcatgccggatggttcctgctcaagtgccctgacctg 
ctccatggcaaactgtcagtatggctgtgatgttgttaaaggacaaatacggtgccagtg 

CCCATCCCCTGGCCTGCACCTGGCTCCTGATGGGAGGACCTGTGTAGATGTTGATGAATG 

tgctacaggaagagcctcctgccctagatttaggcaatgtgtcaacacttttgggagcta 
CAT C7G caagtgt cataaaggcttcgatctcatgtatattggaggcaaatatcaatgtca 
tgacatagacgaatgctcacttggtcagtatcagtgcagcagctttgctcgatgttataa 
cgtacgtgggtcctacaagtgcaaatgtaaagaaggataccagggtgatggactgacttg 
tgtgtatatcccaaaagttatgattgaaccttcaggtccaattcatgtaccaaagggaaa 
tggtaccattttaaagggtgacacaggaaataataattggattcctgatgttggaagtac 
ttggtggcctccgaagacaccatatarrcctcctatcattaccaacaggcctacttctaa 

GCCAACAACAAGACCTACACCAAAGCCAACACCAATTCCTACTCCACCACCACCACCACC 
CCTGCCAACAGAGCTCAGAACACCTCTACCACCTACAACCCCAGAAAGGCCAACCACCGG 
ACTGACAACTATAGCACCAGCTGCCAGTACACCTCCAGGAGGGATTACAGTTGACAACAG 
GGTACAGACAGACCCTCAGAAACCCAGAGGAGATGTGTTCAGTGTTCTGGTACACAGTTG 
TAATTTTGACCATGGAC7TTGTGGATGGATCAGGGAGAAAGACAATGACTTGCACTGGGA 
ACCAATCAGGGACCCAGCAGGTGGACAATA-TCTGACAGTGTCGGCAGCCAAAGCCCCAGG 
GGGAAAAGCTGCACGCTTGGTGCTACCTCTCGGCCGCCTCATGCATTCAGGGGACCTGTG 
CC7GTCATTCAGGCACAAGGTGACGGGGCTGCACTCTGGCACACTCCAGGTGTTTGTGAG 
;iJlAACACGGTGCCCACGGAGCAGCCCTGTGGGGAAGAAATGGTGGCCATGGCTGGAGGCA 
AACACAGATCACCTTGCGAGGGGCTGACATCAAGAGCGAATCACAAAGATGATTAAAGGG 
TTGGAA-AAAAAGATCTATGATGGAAAATTAAAGGAACTGGGATTATTGAGCCTGGAGAAG 
AGAAGACTGAGGGGCAAACCATTGATGGTTTTCAAGTATATGAAGGGTTGGCACAGAGAG 
GGTGGCGACCAGCTGTTCTCCATATGCACTAAGAATAGAACAAGAGGAAACTGGCTTAGA 
CTAGAGTATAAGGGAGCATTTCTTGGCAGGGGCCATTGTTAGAATACTTCATAAAAAAAG 
AA.GTGTGAAAATCTCAGTATCTCTCTCTCTTTCTAAAAAATTAGATAAAAATTTGTCTAT 
■TTAAGATGGTTAAAGATGTTCTTACCCAAGGAAAAGTAACAAATTATAGAATTTCCCAAA 
AGATGTT7TGATCCTACTAGTAGTATGCAG7GAAAATCTTTAGAACTAAATAATTTGGAC 
AAGGC7TAA7TTAGGC:ATTTCCC7C77GACCTCC7AATGGAGAGGGATTGAAAGGGGAAG 
AGCCCACCAAATGCTGAGCTCACTGAAATATCTC7CCCTTATGGCAA7CCTAGCAGTATT 
AAAGAAAAAAGGAAACTATTTATTCCAAA7GAGAGTATGATGGACAGATA7TTTAGTATC 
TCAGTAA7GTCCTAGTGTGGCGGTGGTTTTCAATGTTTCTTCATGGTAAAGGTATAAGCC 

tttcatttgttcaatggatgatgtttcagatttttttttttttaagagatccttcaagga 

ACAC:AG7TCAGAGAGATTTTCATCGGG7GCATTCTCTCTGCTTCGTG7GTGACAAGTTAT 
CTTGGCTGC7GAGAAAGAGTGCCC7GCCCCACACCGGCAGACCTT7CCTTCACCTCATCA 
GTA7GATTCAGTT7CTCTTA7CAAT7GGACTC7CCCAGGTTCCACAGAACAGTAATATTT 
TTTGAAC:AATAGG7ACAATAGAAGGTCTTC7GTCATTTAACCTGG7AAAGGCAGGGCTGG 
AGGGGGAAAATAAATCATTAAGCCTTTGAGTAACGGCAGAATATATGGCTGTAGA7CCAT 
TTTTAA7GGTTCA7TTCCTTTATGGTCATATAACTGCACAGCTGAAGATGAAAGGGGAAA 
ATAAATGAAAAT7TTACTTT7CGATGCCAATGATACATTGCACTAAACTGATGGAAGAAG 

T T AG TGG7TTTCCAAATGG CCT AATAAAAACAA7T ATTTGT AAATAAAAACACTG7TAG7 
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FIGURE 110 

xsignal pept:ide> 
MDFLLAl-VLVS S L YLQA 
xstiart mature protein> 

AT^S TDGRW ? RQ I VS S 1 3LCRYGGR I DC CWGW?J^QS WGQCQPV 
XEGF-like repeats, 60-253> 

CQFRCKKGECIGPNKCKCH?GYAGKTCNQDLN£CGLKPRPCKHRCMNTYGSYKCYCLNGYML 

MPDGSCS3ALTCSHANCQYGCDVVKGQIRCQCPSPGLHLAPDGRTCVDVDECATGRASCPRF 

RQCVK7FGSYICKCHKGFDLMYIGGKYQCHDIDECSLGQVCCSSFARCYNVRGSYKCKCKFG 

YQGDGLTCVTIPKVMIEPSGPInVPKG 

xpotential N-glycosyiation site> 

NGTILKGDTGNmmiPDVGSTWWPPKTPYIPFIIiNRPTSKPTTRPTPKPTPIPTPPFPFFL 
PTELRTPLPFTTPERPTTGLTTIAPAASTPPGGITVDNRVQTDPQKPRGDVFSVLVHSCNFD 
HGLCGWIREKDblDLHWEPIRDPAGGQYLTVSAAKAPGGKAARLVLPLGRLMHSGDLCLSFRH 
KVTGLHSGTI.QVrV?.KHGAF:GAALWGRNGGHGWRQTQITLKGADIKSESQR " • 
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FIGURE m 

CTTCrrT3A;iJ^AGGATTATCACCTGATCAC-GTTCICTCTGCATTTGCCCCTTTAGAT'^GTG^ 
AATGrGGGTCAi^.GGTCTTCACAsCTTTCCTTTCCTTTGG-A^CAGGTGCTTGCTCGGGGCTGA 
AGGTGACAGTGCCATCACACACTGTCCATGGCGTGAGAGGrCAGGCCCTCTACCTACCCGTC 
CACTA7GGCTTCCACACTCCAGCATCAGACATCCAGATCATATGGCTATTTGAGAGACCCCA 
CACi^ATGCCCAJiATACTTACTGGGCTCTGTGAATAAGTCTGTGGTTCCTGACTTGG/^A.TACC 
AACAC-2AGTTCACCATGATGCCACCCPATGCATCTCTGCTTATCAACCCACTGCAGTTCCCT 
GAT GAAG G C-?A.T TAG AT CGT GA^i.GGTCAACAT T CAGGGAAATGGAACTCTATCTGCCAGTCA 
GAAGArACAAGTCAGGGTTGATGATCCTGTCACAAAGCCAGTGGTGCAGATTCATCCTCCCT 
CTGGGGCTGTGGAGTATGTGGGG^ACATGACCCTGACATGCCATGTGGAAGGGGGCACTCGG 
C T AG C T TAG CMTG G C TAAA=AATGGGAGAC C TGTCCACACCAGC TCCACC TAG "^CC T'T TTC 
TCCCC;iAAAC;iJ^TACCCTTCATATTGCTCCAGTAACCAAGGAAGACATTGGGAATTACAGCT 
G C C T G G T GAG G AAC CCTGTCAGr GAAAT G GAAAG T GATAT CATTAT G C C CAT CAT AT AT TAT 
G G AC C T TAT G G AC T T CAAG T GAAT T C T GAT AAi^GGG C TAAAAGTAGGGGAAGTGT T TAC TG T 
TGACCTTGGAGAGGCCATCCTATTTGATTGTTCTGCTGATTCTCATCCCCCCAACACCTACT 
CCTGGATTAGGAGGACTGACAATACTACATATATCATTAAGCATGGGCCTCGCT'TAGAAGT'T 
GCATC7GAGAAAGTAGCCCAGAAGACAATGGACTATGTGTGCTGTGCTTACAACAACATAAC 
CGGCAGGCAAGATGAAACTCATTTCACAGTTATCATCACTTCCGTAGGACTGGAGAAGCTTG 
CACAGAAAGGAAAATCATTGTCACCTTTAGCAAGTATAACTGGAATATCACTATTTTTGATT 
ATATCCATGTGTCTTCTCTTCCTATGGAAAAAATATCAACCCTACAAAGTTATAAAACAGAA 
ACTAGAAGGCAGGCCAGAAACAGAATACAGGAAAGCTCAAACATTTTCAGGCCATGAAGATG 
CTCTGGATGACTTCGGAATATATGAATTTGTTGCTTTTCCAGATGTTTCTGGTGTTTCCAGG 
ATTCCAAGCAGGTCTGTTCCAGCCTCTGATTGTGTATCGGGGCAAGATTTGCACAGTACAGT 
GTATGAAGTTATTCAGCACATCCCTGCCCAGCAGCAAGACCATCCAGAGTGAACTTTCATGG 
GCTAAACAGTACATTCGAGTGAAATTCTGAAGA^J^CATTTTAAGGAAAAACAGTGGAAAAGT 
ATATTAATCTGGA.2TCAGTGAAGA7VACCAGGACCAACACCTCTTACTCATTATTCCTTTACA 
TGCAGAATAGAGGCATTTATGCAJiA.TTGAJVCTGCAGGTTTTTCAGCATATACACAATGTCTT 

GTGCAACAGA?UiAACATGTTGGGGAAATATTCCTCAGTGGAGAGTCGTTCTCATGCTGACGG 
GGAG^A.CGAAJVGTGACAGGGGTTTCCTCATAAGTTTTGTATGAAATATCTCTACAAACCTCA 
ATTAGTTCTACTCTACACTTTCACTATCATCAACACTGAGACTATCCTGTCTCACCTACAAA 
TGTGGAAACTTTACATTGTTCGATTTTTCAGCAGACTTTGTTTTATTAAATTTTTATTAGTG 
TT;iAGAATGCTAAATTTATGTTTCAATTTTATTTCCAAATTTCTATCTTGTTATTTGTACAA 
p^A.GTAATAAGGATGGTTGTCACAAAAACAAAACTATGCCTTCTCTTTTTTTTCAATCACC 
AG T AGTAT T T T TGAGAAGACTTGTGAACACTTAAGGAAATGACTATTAAAGTCTTATTTTTA 
TTTTTTTCAAGGAAAGATGGATTCAAA.TAAATTATTCTGTTTTTGCTTTTAAAAAAAAAAAAAA 
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FTGintF ip 

Me- Tr? Leu Lys Val ?he Tnr Thr Fhe Leu Ser Pae Ala :-r Gly Ala 
Cys Ser Giy Leu Lys Val Thr Val Fro Ser Kis Thr Val His Gly Val 
Arc Gly Gin A_a Leu Tyr Leu Pro Val His Tyr Gly Phe His Thr Pre 
Ala Ser Asp lie Gin He He Trp Leu Phe Glu Arg Pro His Thr Mez 
Pro Lys Tyr Leu Leu Gly Ser Val Asn Lys Ser Val Val Pro Asp Leu 
Glu Tyr Gin His Lys Phe Thr Met Net Pro Pro Asn Ala Ser Leu Leu 
lie Asn Pre Leu Gin Phe Pro Asp Glu Gly Asn Tyr He Val Lys Val 
Asn lie Gin Gly Asn Gly Thr Leu Ser Ala Ser Gin Lys He Gin Val 
Tnr Val Asp Asp Pro Val Thr Lys Pro Val Val Gin He His Pre Pro 
Ser Gly Ma Val Glu Tyr Val Gly Asn Met Thr Leu Thr Cys His Val 
Glu Gly Gly Thr Arg Leu Ala Tyr Gin Trp Leu Lys Asn Gly Arg Pro 
Val His Thr Ser Ser Thr Tyr Ser Phe Ser Pro Gin Asn Asn Thr Leu 
His He Ala Pro Val Thr Lys Glu Asp He Gly Asn Tyr Ser Cys Leu 
Val Arg Asn Pro Val Ser Glu Met Glu Ser Asp He He Met Pro He 
He Tyr Tyr Gly Pro Tyr Gly Leu Gin Val Asn Ser Asp Lys Gly Leu 
Lys Val Gly Glu Val Phe Thr Val Asp Leu Gly Glu Ala He Leu Phe 
Asp Cys ser Ala Asp Ser His Pro Pro Asn Thr Tyr Ser Trp He Arg 
Arc Thr Asp Asn Thr Thr Tyr He He Lys His Gly Pro Arg Leu Glu 
Val Ala Ser Glu Lys Val Ala Gin Lys Thr Met Asp Tyr Val Cys Cys 
Ala Tyr Asn Asn He Thr Gly Arg Gin A^p Glu Thr His Phe Thr Val 
He He Thr Ser Val Gly Leu Glu Lys Leu Ala Gin Lys Gly Lys Ser 
Leu Ser Pro Leu Ala Ser He Thr Gly He Ser Leu Phe Leu He He 
Ser Met Cys Leu Leu Phe Leu Trp Lys Lys Tyr Gin Pro Tyr Lys Val 
He Lys Gin Lys Leu Glu Gly Arg Pro Glu Thr Glu Tyr Arg Lys Ala 
Gin Thr Phe Ser Gly His Glu Asp Ala Leu Asp Asp Phe Gly He Tyr 
Glu Phe Val A^a Phe Fro Asp Val Ser Gly Val Ser Arg He Pro Ser 
Ser Val Pro Ala Ser Asp Cys Val Ser Gly Gin Asp Leu His Ser 
Thr Val Tyr Glu Val He Gin His He Pro Ala Gin Gin Gin Asp His 
Pre Glu 
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FIGTJRE m 

G C AAG C G G CGAA 

xKET {trans = l-s, dir=f, re5 = l} 

ATGGCGCCCTCCGGGAGTCTTGCAGTTCCCCTGGCAGTCCTGGTGCTGTTGCTTTGGGGT 
GGTCCCTGGACGCACGGGCGGCGGAGCAACGTTCGCGTCATCACGGACGAGAACTGGAGA 
GAACTGCTGGAAGGAGACTGGATGATAGAATTTTATGCCCCGTGGTGCCCTGCTTGTCAA 
AATCTTCAACCGGAATGGGAAAGTTTTGCTGAATGGGGAGAAGATCTTGAGGTTAATATT 
GCGAAAGTAGATGTCACAGAGCAGCCAGGACTGAGTGGACGGTTTATCATAACTGCTCTT 
CCTACTATTTATCATTGTAAAGATGGTGAATTTAG3CGCTATCAGGGTCCAAGGAGTAAG 
AAGGACTTCATAAACTTTATAAGTGATAAAGAGTGGAAGAGTATTGAGCCCGTTTCATCA 
TGGTTTGGTCCAGGTTCTGTTCTGATGAGTAGTATGTCAGCACTCTTTCAGCTATCTATG 
TGGATCAGGACGTGCCATAACTACTTTATTGAAGACCTTGGATTGCCAGTGTGGGGATCA 
TATAGTGTTTTTGCTTTAGCAACTCTGTTTTCCGGACTGTTATTAGGACTCTGTATGATA 
TTTGTGGCAGATTGCCTTTGTCCTTCAAAAAGGCGCAGACCACAGCCATACCCATACCCT 
TCAAAAAAATTATTATCAGAATCTGCACAACCTTTGAAAAAAGTGGAGGAGGAACAAGAG 
GCGGATGAAGAAGATGTTTCAGAAGAAGAAGCTGAAAGTAAAGAAGGAACAAACAAAGAC 
TTTCCACAGAATGCCATAAGACAACGCTCTCTGGGTCCATCATTGGCCACAGATAAATCC 
TAGTTAAATTTTATAGTTATCTTAATATTATGATTTTGATAAAAACAGAAGATTGATCAT 
TTTGTTTGGTTTGAAGTGAACTGTGACTTTTTTGAATATTGCAGGGTTCAGTCTAGATTG 
TCATTAAATTGAAGAGTCTACATTCAGAACATAAAAGCACTAGGTATACAAGTTTGAAAT 
ATGATTTAAGCACAGTATGATGGTTTAAATAGTTCTCTAATTTTTGAAAAATCGTGCCAA 
GCAATAAGATTTATGTATATTTGTTTAATAATAACCTATTTCAAGTCTGAGTTTTGAAAA 
TTTACATTTCCCAAGTATTGCATTATTGAGGTATTTAAGAAGATTATTTTAGAGAAAAAT 
ATTTCTCATTTGATATAATTTTTCTCTGTTTCACTGTGTGAAAAAAAGAAGATATTTCCC 
ATAAATGGGAAGTTTGCCCATTGTCTCAAGAAATGTGTATTTCAGTGACAATTTCGTGGT 
CTTTTTAGAGGTATATTCCAAAATTTCCTTGTATTTTTAGGTTATGCAACTAATAAAAAC 
TACCTTACATTAATTAATTACAGTTTTCTACACATGGTAATACAGGATATGCTACTGATT 
TAGGAAGTTTTTAAGTTCATGGTATTCTCTTGATTCCAACAAAGTTTGATTTTCTCTTGT 
ATTTTTCTTACTTACTATGGGTTACATTTTTTATTTTTCAAATTGGATGATAATTTCTTG 
GAAACATTTTTTATGTTTTAGTAAACAGTATTTTTTTGTTGTTTCAAACTGAAGTTTACT 
GAGAGATCCATCAAATTGAACAATCTGTTGTAATTTAAAATTTTGGCCACTTTTTTCAGA 
TTTTACATCATTCTTGCTGAACTTCAACTTGAAATTGTTTTTTTTTTCCTTTTGGATGTG 
AAGGTGAACATTCCTGATTTTTGTCTGATGTGAAAAAGCCTTGGTATTTTACATTTTGAA 
AATTCAAAGAAGCTTAATATAAAAGTTTGCATTCTACTCAGGAAAAAGCATCTTCTTGTA 
TATGTCTTAAATGTATTTTTGTCCTCATATACAGAAAGTTCTTAATTGATTTTACAGTCT 
GTAATGCTTGATGTTTTAAAATAATAACATTTTTATATTTTTTAAAAGACAAACTTCATA 
TTATCCTGTGTTCTTTCCTGACTGGTAATATTGTGTGGGATTTCACAGGTAAAAGTCAGT 
AGGATGGAACATTTTAGTGTATTTTTACTCCTTAAAGAGCTAGAATACATAGTTTTCACC 
TTAAAAGAAGGGGGAAAATCATAAATACAATGAATCAACTGACCATTACGTAGTAGACAA 
TTTCTGTAATGTCCCCTTCTTTCTAGGCTCTGTTGCTGTGTGAATCCATTAGATTTACAG 
TATCGTAATATACAAGTTTTCTTTAAAGCCCTCTCCTTTAGAATTTAAAATATTGTACCA 
TTAAAGAGTTTGGATGTGTAACTTGTGATGCCTTAGAAAAATATCCTAAGCACAAAATAA 
ACCTTTCTAACCACTTCATTAAAGCTGAAAAAAAAAAAAAAAAAA 
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FIGURE 114 

KJl?S G£ LAV?LA'v'L'v1jLLWGAPWTKG?.RSNVRVI TDENW?3L 

WES FAEWGZDLEVN: AKV^^^^'EQP 3LSGRF 1 1 TALFT I ^-HCKDGE? 
KDr I N7 1 SDKS WKS I S PVS S W?G?GS VI>:S SMS ALFQLSMWI RTC-iNYF I EDLGL? VKGS 
YTVFALATLFSGLLLGLCMIFVADCLCPSKRRRPQFYPYPSKKLLSESAQPLKKVZEEQE 
ADEEBVSEEEAESKSG7NKDFPQNAIRQRSLGPSLATDKS 



120 ; 128 



wo 99/14328 



PCT/U598/19330 



FIGTJRKIIS 

GCGAGTGTCCAGCTGCGC-AGACCCGTGATAATTCGTTAACTAATTCAACAAACGGGACCC 
TTCTGTGTGCCAGAAACCGCAAGCAGTTGCTAACCCAGTGGGACAGGCGGATTGGAAGAG 
CGGGAAGGTCCTGGCCCAGAGCAGTGTGAGACrrCCCTCTGTGACC 
x.MET -{ trans = l , dir=f, res = l} 

ATGAAACTCTGGGTGTCTGCATTGGTaATGGCCTGGTTTGGTGTCCTGAGCTGTGTGCAG 

GCCGAATTCTTCACCTCTATTG3GGACATGACTGACCTGATTTATGCAGAGAAAGAGCTG 

GT3CAGTCTCTGAAAGAGTACATCCTTGTGGAGGAAGCCAAGCTTTCCAAGATTAAGAGC 

TGGG CCAACAAAATGGAAG C CTTGACTAG CAAGTCAGCTGCTGATGCTGAGGGCTAC CTG 

GCTCACCCTGTGAATGCCTACAAACTGGTGAAGCGGCTAAACACAGACTGGCCTGCGCTG 

GAGGACCTTGTCCTGCAGGACTCAGCTGCAGGTTTTATCGCCAACCTCTCTGTGCAGCGG 

CAGTTCTTCCCCACTGATGAGGACGAGATAGGAGCTGCCAAAGCCCTGATGAGACTTCAG 

GACACATACAGGCTGGACCCAGGCACAATTTCCAGAGGGGAACTTCCAGGAACCAAGTAC 

CAGG CAATG CTGAGTGTG GATG ACTGCTTTGGGATGGGCCGCTCGGCCTACAATGAAGGG 

GACTATTATCATACGGTGTTGTGGATGGAGCAGGTGCTAAAGCAGCTTGATGCCGGGGAG 

GAGGCCACCACAACCAAGTCACAGGTGCTGGACTACCTCAGCTATGCTGTCTTCCAGTTG 

GGTGATCTGCACCGTGCCCTGGAGCTCACCCGCCGCCTGCTCTCCCTTGACCCAAGCCAC 

GAACGAGCTGGAGGGAATCTGCGGTACTTTGAGCAGTTATTGGAGGAAGAGAGAGAAAAA 

ACGTTAACAAATCAGACAGAAGCTGAGCTAGCAACCCCAGAAGGCATCTATGAGAGGCCT 

GTGGACTACCTGCCTGAGAGGGATGTTTACGAGAGCCTCTGTCGTGGGGAGGGTGTCAAA 

CTGACACCCCGTAGACAGAAGAGGCTTTTCTGTAGGTACCACCATGGCAACAGGGCCCCA 

CAGCTGCTCATTGCCCCCTTCAAAGAGGAGGACGAGTGGGACAGCCCGCACATCGTCAGG 

TACTACGATGTCATGTCTGATGAGGAAATCGAGAGGATCAAGGAGATCGCAAAACCTAAA 

CTTGCACGAGCCACCGTTCGTGATCCCAAGACAGGAGTCCTCACTGTCGCCAGCTACCGG 

GTTTCCAAAAGCTCCTGGCTAGAGGAAGATGATGACCCTGTTGTGGCCCGAGTAAATCGT 

CGGATGCAGCATATCACAGGGTTAACAGTAAAGACTGCAGAATTGTTACAGGTTGCAAAT 

TATGGAGTGGGAGGACAGTATGAACCGCACTTCGACTTCTCTAGGCGACCTTTTGACAGC 

GGCCTCAAAACAGAGGGGAATAGGTTAGCGACGTTTCTTAACTACATGAGTGATGTAGAA 

GCTGGTGGTGCCACCGTCTTCCCTGATCTGGGGGCTGCAATTTGGCCTAAGAAGGGTACA 

GCTGTGTTCTGGTACAACCTCTTGCGGAGCGGGGAAGGTGACTACCGAACAAGACATGCT 

GCCTGCCCTGTGCTTGTGGGCTGCAAGTGGGTCTCCAATAAGTGGTTCCATGAACGAGGA 

CAGGAGTTCTTGAGACCTTGTGGATCAACAGAAGTTGACTGACATCCTTTTCTGTCCTTC 

CCCTTCCTGGTCCTTCAGCCCATGTCAACGTGACAGACACCTTTGTATGTTCCTTTGTAT 

GTTCCTATCAGGCTGATTTTTGGAGAAATGAATGTTTGTCTGGAGCAGAGGGAGACCATA 

CTAGGGCGACTCCTGTGTGACTGAAGTCCCAGCCCTTCCATTCAGCCTGTGCCATCCCTG 

GCCCCAAGGCTAGGATCAAAGTGGCTGCAGCAGAGTTAGCTGTCTAGCGCCTAGCAAGGT 

GCCTTTGTACCTCAGGTGTTTTAGGTGTGAGATGTTTCAGTGAACCAAAGTTCTGATACC 

TTGTTTACATGTTTGTTTTTATGGCATTTCTATCTATTGTGGCTTTACCAAAAAATAAAA 
T GT C C CT AC C AGAAAJLAAAAAA 
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FIGTJRE 116 

xstgaal peptide> 
MKLW^^SALLMAWFGVLS 
><stan mature pro:ein> 

CVQAEFFrSlGH\rroUYAEK£LVQSIiCEYILVEEAKLSKIKSWANK^ 
GYLAHPWAYKLVKRLNTDWPALEDLVLQDSAAGnA 
xN-glycosylation site.> 

M.SVQRQFFPTDEDElGAAK.M.MRLQDTYiaJ>PGTlSRGELPGTKYQAMLSVDDCF 

GMGRSAYTs:EGDYYHT\a.WMEQVLKQLDAGEEATTTKSQVLDY 

xpotential Leucine zipper 213-234> 

LSYAWQLGDLHRAI^LTRRLLSIJDPSHEIUGG>O.YFEQlXEEEREKTLT 
><N-glycosylation site.> 

KQTEAELATPEGIYERPNTDYLPERDVYESLCRGEGVKLTPRRQKRIJCRYHHGm 
QLLIAPFK£EDEWDSPmVRYYDVMSDEEIERIKEIAKPKLARATVRDPkTGVL 
YRVSKS S WLEEDDDP WARVNRRMQHITGLTVKTAEIXQVANYGVGGQYEPI^ 
RRPFDSGLKTEGNKLATFLN^ISDVEAGGATVPPDLGAAIWPKKGTAVFWYNU^ 
GEGDYKTRHAACPVLVGCKWVSNKWFHERGQEFLRPCGSTEVD 
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FIGURE 117 

GCAGTATTGAGTTTTACTTCCTCCTCTTTTTAGTGGAAGACAGACCATAATCCCAGTGTGAG 
TGAAA7TGATTGT7TCAT7TATTACCGTTTTGGCTGGGGGTTAGTTCCGACACCTTCACAGT 
TGAAGAGCAGGCAGAAGGAGTTGTGAAGACAGGACAATCTTCTTGGGGATGCTGGTCCTGGA 
AGCCAGCGGGCCTTGCTCTGTCTTTGGCCTCATTGACCCCAGGTTCTCTGGTTAAAACTGAA 
AGCCTACTACTGGCCTGGTGCCCATCAATCCATTGATCCTTGAGGCTGTGCCCCTGGGGCAC 
CCACCTGGCAGGGCCTACCACCATGCGACTGAGCTCCCTGTTGGCTCTGCTGCGGCCAGCGC 
TTCCCCTGATCTTAGGGCTGTCTCTGGGGTGCAGCCTGAGCCTCCTGCGGGT^CCTGGATC 
CAGGGGGAGGGAGAAGATCCCTGTGTCGAGGCTGTAGGGGAGCGAGGAGGGCCACAGAATCC 
AGATTCGAGAGCTCGGCTAGACCAAAGTGATGAAGACTTCAAACCCCGGATTGTCCCC^ACT 
ACAGGGACCCCAACAAGCCCTACAAGAAGGTGCTCAGGACTCGGTACATCCAGACAGAGCTG 
GGCTCCCGTGAGCGGTTGCTGGTGGCTGTCCTGACCTCCCGAGCTACACTGTCCACTTTGGC 
CGTGGCTGTGAACCGTACGG7GGCCCATCACTTCCCTCGGTTACTCTACTTCACTGGGCAGC 
GGGGGGCCCGGGCTCCAGCAGGGA7GCAGG7GGTGTCTCATGGGGA7GAGCGGCCCGCCTGG 
C7CA7G7CAGAGACCCTGCGCCACCTTCACACACACTTTGGGGCCGAC:TACGACTGGTTCTT 
CATCA7GCAGGATGACACA7ATG7GCAGGCCCCCCGCCTGGCAGCCCTTGCTGGCC:ACCTCA 
GCA7CAACCAAGACC7GTACTTAGGCCGGGCAGAGGAGTTCATTGGCGCAGGCGAGCAGGCC 
CGG7ACTGTCA7GGGGGC7TTGGCTACCTGTTGTCACGGAGTCTCCTGCTTCGTCTGCGGCC 
ACATC7GGATGGCTGCCGAGGAGACATTCTCAGTGCCCGTCCTGACGAGTGGCTTGGACGCT 
GCCTCA7TGAC7CTCTGGGCG7CGGCTGTGTCTCACAGCACCAGGGGCAGCAGTATCGCTCA 
TTTGAACTGGCCAAAAATAGGGACCC7GAGAAGGAAGGGAGCTCGGCTTTCCTGAGTGCCT7 




GTGC7GACCCCCGAAGGGGAGGCAGGGCTGAGCTGGCCCGTTGGGCTCCCTGCTCCTTTCAC 
ACCACAC7C7CGCrT7GAGG7GCTGGGCTGGGACTACTTCACAGAGCAGCACACCTTCTCCT 
G7GCAGA7GGGGCTCCCAAGTGCCCACTACAGGGGGCTAGCAGGGCGGACGTGGGTGATGCG 
T7GGAGACTGCCCTGGAGCAGC7CAATCGGCGCTA7CAGCCCCGCCTGCGCTTCCAGAAGCA 
GCGAC7GCTCAACGGC7A7CGGCGCTTCGACCCAGCACGGGGCATGGAGTACACCCTGGACC 




GCA.GC7GG7GCTGCCACTCCTGG7GGCTGAAGCTGC7GCAGCCCCGGCTTTCCTCGAGGCGT 
T7G CAG C C AA7 G 7 CC7GGAG CCACGAGAACA7GCArrGCTCACCCTGTTG CTGGTCTACGGG 
CCACGAGAAGGTGGCCGTGGAGCTCCAGACCCATT7CTTGGGGTGAAGGCTGCAGCAGCGGA 

G77AGAGCGACGGTACCCTGGGACGAGGC7GGCCTGGCTCGCTGTGCGAGCAGAGGCCCCTT 
rm).(ir:Tarni.rTniTnn7s.nn.'vr^n>rT^>^rT,r^r^-^~^„^ 




- — i vjvjo\_\_u\jrtA*jx I UAAUUUt- i U iLtjtJATGAATGCCATCTC 

TGGCTGGCAGGCC77C7T7CCAGTCCA77TCCAGGAGTTCAA7CC7GCCCTGTCACCACAGA 
GA7CACCCCCAGGGCCCCCGGGGGC7GGCCCTGACCCCCCCTCCCCTCCTGGTGCTGACCCC 
7CCCGGGGGGCTCC7A7AGGGGGGAGATTTGACCGGCAGGCTTCTGCGGAGGGCTGCTTCTA 
CAACGC7GACTACC7GGCGGCCCGAGCCCGGCTGGCAGGTGAACTGGCAGGCCAGGAAGAGG 
AGGAAGCCCTGGAGGGGC7GGAGGTGATGGATG7TTTCCTCCGGTTCTCAGGGCTCCACCTC 
777CGGGCCG7AGAGCCAGGGCTGG7GCAGAAGTTCTCCCTGCGAGACTGCAGCCCACGGC7 

^?^^^5ST^J^?^^'^^^^TTTGAGCAGGAGCAGGCCAATAGCACTTAGCCCGCCrrGGGGGCCC 

AGA I AGAGAA7TGT7GC7G7AT77777AAA7ATGAAAATGTrAT7AAACATGTCTTCrrGCC 
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nGlJRF ns 

xsigr.c-l peptide> 
KKL £ 5 LLAILR F 
xstart mature prote:.n> 

LILGLSLGCSLSLLRV3WIQGEGED?CVZAVGERGG?QNPD5RARLDQSDEDrKPR:VPY 

YRDPNKPYKKVLR7RYIQTELGSRERLLVAVLTSRATLSTLAVAV 

xpotential N-glycosyiation site> 

NRTVAHHrPRLLYFTGQRGARAPAGMQVVSHGDERFAWLMSETLRHLHTKFGADYD 
xhomoiogy to radical fringe -177-241> 

WFFIHQDDTYVQAPRLAALAGHLSINQDLYLGRAEEFIGAGEQARYCHGGFGYLLSRSLLLR 

lrfhz.dgcrgd:lsarpdewlgrclidslgvgcvsqkqgqqyrsfelaknrdpekec-ssaFi. 
3afa\^pvsegtlmyrlhkrfsalelerayseieqlqaqir 

xpotential N-glycosylation site> 

NLTVLTPEGEAGL5W?VGL?;i-PFTPHSRFEVLGWDYFTEQHTFSCADGAPKCPLQGASRADV 
GDALETALEQLNRRYQPRLRFQKQRLLNGYRRFDPARGMEYTLDLLLECVTQRGHKRALARR 
VSLLRP L SRV-E I LFMPYVTE ATRVQLVLPLLVAEAAAAPAFLEAFAW/LHPKEHALLTLLL 
\nf G PRE GGRGAF DP FLGVKAAAAELERRYF GTRIJVWIAVRAEAPS QVRmOWSKKHPVBTL 
FFLTTV;'7TR?GPEVLNRCRKNAISGWQAFFPVHFQEFNPALSPQRSPPGPPGAGPDPPSPPG 
?JD?S RGAF I G GR FDRQAS AE GC FYN?J}YljyuUlRLAVNWOARKRRKPLEGLEVMDVFLRFS G 
LKlrPA.VEPGLVQKFSLRDCSPRLSEELY-HRCRLSNLEGLGGRAQLAKALFEQEQANST 
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EIGURE119 

CGGAGTGGTGCGCCAACGTGAGAGGAAACCCG7GCGCG3CTGCGCTTTCCTGTCCCCAAGCC 

GTTCTAGACGCGGGAAAAATGCTTTGTGA.^AGCAGCTCCTTTTTGAAGC<^TGTGATGCTTGG 

AAGCAT.TTTGTGTGCTTTGATCACTATGCTAGGACACATTAGGATTGGTCATGGAAATAGAA 

TGCACCACCATGAGCATCATCACCTACAAGCTCCTAACAAAGAAGATATCTTGAAAATTTCA 

GAGGATGAGCGCATGGAGGTCAGTAAGAGCTTTCGAGTATACTGTATTATCCTTGTAAAACC 

CAAAGATGTGAGTCTTTGGGCTGCAGTAAAGGAGACTTGGACCAAACACTGTGACAAAGCAG 

AGTTCTTCAGTTCTGAAAATGTTAAAGTGTTTGAGTCAATTAATATGGACACAAATGACATG 

TGGTTAATGATGAGAAAAGCTTACAAATACGCCTT7GATAAGTATAGAGACCAATACAACTG 

GTTCTTCCTTGCACGCCCCACTACGTTTGCTATCATTGAAAACCTAAAGTATTTTTTGTTAA 

AAJ^GGATCCATCACAGCCTTTCTATCTAGGCCACACTATAAAATCTGGAGACCTTGAATAT 

GTGGGTATGGAAGGAGGAATTGTCTTAAGTGTAGAATCAATGAAAAGACTTAACAGCCTTCT 

CAATATCCCAGAJIAAGTGTCCTGAACAGGGAGGGATGATTTGGAAGATATCTGAAGATAAAC 

AGJTAGCAGTTTGCCTGAAATATGCTGGAGTATTTGCAGAAAATGCAGAAGATGCTGATGGA 

/^AGATGTATTTAATACCAAATCTGTTGGGCTTTCTATTAAAGAGGCAATGACTTATCACCC 

CAACCAGGTAGTAGAAGGCTGTTGTTCAGATATGGCTGTTACTTTTAATGGACTGACTCCAA 

ATCAGATGCATGTGATGATGTATGGGGTATACCGCCTTAGGGCATTTGGGCATATTTTCAAT 

GATGCATTGGTTTTCTTACCTCCAAATGGTTCTGACAATGACTGAGAAGTGGTAGAAAAGCG 

TGAATA7GATCTTTGTATAGGACGTGTGTTGTCATTATTTGTAGTAGTAACTACATATCCAA 

TACAGCTGTATGTTTCTTTTTCTTTTCTAATTTGGTGGCACTGGTATAACCACACATTAAAG 

TCAGTAGTACATTTTTAAATGAGGGTGGTTTTTTTCTTTAAAACACATGAACATTGTAAATG 

TGTTGGAAAGAAGTGTTTTAAGAATAATAATTTTGCAAATAAACTATTAATAAATATTATAT 

GTGATAAATTCTAAATTATGAACATTAGAAATCTGTGGGGCACATATTTTTGCTGATTGGTT 

AAAAAATTTTAACAGGTCTTTAGCGTTCTAAGATATGCTU^ATGATATCTCTAGTTGTGAATT 

TGTGA7TAAAG7AAAACTTTTAGCTGTGTGTTCCCTTTACTTCTAATACTGATTTATGTTCT 

AAGCCTCCCCAAGTTCCAATGGATTTGCCTTCTCAAAATGTACAACTAAGCAACTAAAGAAA 
ATTAAAGTGAAAGTTGAAAAAT 
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FTGTTT?F MO 

xsignal peptide> 

MLS£SSSFLKG\TvlLGSrFC.'\LITMLGHnUGHG 
xstart mature protem> 

NR^^HHHEHHHLQAP^^CED^.KISEDER^lELSKSFRVYCI]LVKPm 

TXHCDKAEFFSSENV'KVFESI>MDTNDMWLMMRiCAYKY^^ 

TTFAIIE^XKYFLLKKDPSQPFYLGHmSGDLEYVG^^GGIVLSVES^^KIU-NS 

EKCPEQGGMIWKISEDKQLAVCLKYAGVFAENAEDADGKDVFNTKSVGLSIKEAMT 

YHPNQ\^'EGCCSDNlAVTFNGLTPNQMHVMMYGVYKLRAf 

xpoiential X-glycosylation site> 

NGSDND 
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HGHREjn 

C C C ACG C 3T C C GAT C TTACCAACL-J^JiAai^.CTCCrGAGGAGAAAG 

AAAAA3 AG AG AG AGAG;^A?.CAAAAA.AC CAJ^AGAG AG AG AA-AArA.T GAAT T CAT C TAAAT CAT 

CTGAA_ACAXAATGCACAGAGA-GAGGATGCTTCTCTTCCCAJiATGTTCTTATGGACTGTTGCT 

GGGATCCCCATCCTATTTCTCAGTGCCTGTTrCATCACCAGATGTGTTGTGACATTTCGCAT 

CTTTCAAACCTGTGATGAGAAAAAGTTTCAGCTACCTGAGAATTTCACAGAGCTCTCCTGCT 

AC;^ATTATGGATCAGGTTGAGTCiiAGAATTGTTGTCCATTGA?.CTGGGAATATTTTCAATCC 

AGCTGCTACTTCTTTTCTACTGACACCATTTCCTGGGCGTTAAGTTTAAAGAACTGCTCAGC 

CATGGGGGCTCACCTGGTGGTTATCAACTCACAGGAGGAGCAGGAJITTCCTTTCCXACAAGA 

AACCT.'JUlATGAGAGAGTTTTrTATTGGACTGTCAGACCAGGTTGTCGAGGGTCAGTGGCAA 

TGGGTGGACGGCACACCTTTGACAAAGTCTCTGAGCTTCTGGGATGTAGGGGAGCCCAACAA 

CATAGGTACCCTGGAGGACTGTGCCACCATGAGAGACTCTTCAAACCCAAGGCAAAATTGGA 

ATGATGTAACCTGTTTCCTCAATTATTTTCGGATTTGTGAAATGGTAGGAATAAATCCTTTG 

^ACAAAGGAAAATCTCTTTAAGAACAGAAGGCACAACTCAAATGTGTAAAGAAGGAAGAGCA 

AG^^ACATGGCCACACCCACCGCCCCACACGAGAAATTTGTGCGCTGAACTTCAAAGGACTTC 

ATAJ\GTATTTGTTACTCTGATACAAATAAAAATAAGTAGTTTTAAATGTTAAAAAAAAAAAA 
AAJULV^^JiAAAAAAAjyUVAAAAJi^^ 

AJL?\AJi. 
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